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REMARKS 

Claims 31-35 were previously pending and are amended herein. Claims 36-38 have been 
added. Support for the newly added claims can be found, for example, in paragraph 0137 of the 
application and in Figure 17. No new matter has been added. 

Rejections Under 35 U.S.C. §112, first paragraph (written description) 

The Examiner rejected claims 33-35 as allegedly failing to meet the written description 
requirement. The Examiner stated that the claims encompass variants of gag, pol and env. In 
particular the Examiner notes that the claims encompass variants of HIV pol having reduced 
reverse transcriptase activity, reduced strand transfer activity or reduced RNaseH activity. In 
making this rejection, the Examiner cited: Fiers v. Revel, Amgen, Inc. v. Chugai 
Pharmaceuticals Co. Ltd, Fiddes v. Baird and University of California v. Eli Lilly and Co. 
Applicants respectfully traverse this rejection. 

The Examiner acknowledges that the present application discloses an HIV pol sequence 
having a mutation that reduce reverse transcriptase activity, a mutation that reduces strand 
transfer activity and a mutations that reduces RNaseH activity. However, the Examiner argues 
that the disclosure of these mutations is inadequate to meet the written description requirement 
for the claimed genus of variants. Applicants disagree. 

The USPTO Written Description Requirement Guidelines explain that the written 
description requirement can be met by a "show[ing] that an invention is complete by disclosure 
of sufficiently detailed, relevant identifying characteristics . . . i.e., complete or partial structure, 
other physical characteristics when coupled with a known or disclosed correlation between 
function and structure, or some combination of such characteristics." Guidelines for Examination 
of Patent Applications Under 35 U.S.C 112 Paragraph 1, "Written Description" Requirement. 
66 Fed. Reg. 1099, 1 106. In Enzo Biochem, Inc. v. Gen-Probe Inc., the Court of Appeals for the 
Federal Circuit quoted this portion of the Guidelines with approval stating that "we are 
persuaded by the Guidelines on this point and adopt the PTO's applicable standard for 
determining compliance with the written description requirement." 296 F.3d 1261 (Fed. Cir. 
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2002). Thus, whether a disclosure meets the written description required for a claimed 
invention, e.g., a claimed protein or variant protein depends, in part on the knowledge of those 
skilled in the art regarding the claimed protein or protein variant. For example, the existence of a 
known correlation between structure and function is a relevant factor in assessing whether the 
written description requirement has been met. 

The HIV genome and the proteins expressed by HIV are extremely well studied. In 
particular, the genomes of many different HIV strains from a variety of clades have been 
sequenced in whole or in part. This is true today and it was true as of the priority date of the 
present application. In addition, the structure and function of many HIV proteins have been 
studied in detail. HIV Pol has been particularly intensively studied because it includes the 
protease and reverse transcriptase functions that are the targets of the major therapeutic 
treatments for AIDS. 

The present application explains that GenBank® and various other sequence databases, 
such as that found on the internet at http://hiv-web.lanl.gov provide HIV sequences (see 
paragraph 0074 of the present specification). As of the priority date of the present application 
hundreds of different HIV strains had been sequenced, in whole or in part. The "HIV Sequences 
Compendium" is published annually by the Theoretical Biology and Biophysics Group, Los 
Alamos National Laboratory. This compendium provides the sequence of numerous HIV 
genomes and an even larger number can be found at a related web site (http://hiv-web.lanl.gov) . 
Applicants have enclosed pages i-viii and pages 457 to 527 of the "HIV Sequences Compendium 
2000", Kuiken et al., eds., published by the Theoretical Biology and Biophysics Group, Los 
Alamos National Laboratory (Exhibit A). This version of the Compendium includes HIV 
sequences published up until the end of 2000. Pages 457 to 527 of the "HIV Sequences 
Compendium 2000" provide an alignment of the amino acid sequence of the HIV-1 proteins 
(along with a few SIV cpz proteins). As explained on page 457 of the Compendium, the 
envelope master sequence alignment alone includes 307 full-length sequences. The 
Compendium itself includes a representative selection of the alignments. The alignments for 
HIV pol begin on page 480. As the Examiner can see, Nearly 100 HIV pol sequences from 
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various clades are aligned with the HXB2 sequence (This is the HIV pol sequence used as the 
basis for the recombinant MVA described in Example 2 (paragraphs 0136-0154) of the present 
application). The conserved and variant amino acids are shown and consensus sequences for 
various clades are presented. Along the top of the pol alignment various functional domains and 
landmarks within pol, for example, the domains associated with RNaseH activity and integrase 
activity, are identified. The Compendium includes similar alignments for all of the other HIV 
proteins. The "HIV Sequences Compendium 2000" provides an alignment of a number of HIV 
pol amino acid sequences with the HIV HXB2 pol sequence. This is exactly the type of publicly 
available information that one can use to identify other HIV pol variants having a mutation that 
inhibits reverse transcriptase activity, RNaseH activity or strand transfer activity. It is apparent 
that many, many HIV protein sequences were known in the art as of the priority date of the 
present application and that alignment of the protein sequences permitted those in the art to 
identify conserved residues and functional domains. 

Numerous mutations within HIV pol that inhibit reverse transcriptase activity, RNaseH 
activity or strand transfer activity are known to those skilled in the art. For example, Snyder et 
al. (J. Virol 74:9668, 2000; Exhibit B) describes at mutation at amino acid 478 of HIV pol that 
interferes with strand transfer activity. Fan et al. {Biochemistry 35:9737, 1997; Exhibit C; under 
separate cover) Gao et al. (J. Mol Biol 277:559, 1998; Exhibit D; under separate cover) and 
Powell et al. (/. Biol Chem. 2772:13262, 1997; Exhibit E; under separate cover) describe HIV 
mutations that reduce RNaseH activity. Finally, Boyer et al. (Proc. Nat 'I Acad. ScL 97:3056, 
2000; Exhibit F) describes mutations that reduce reverse transcriptase activity. These references 
are just a sampling of the many publications describing mutations that inhibit the activity of HIV 
pol. 

As noted above, the present specification identifies mutations in HIV pol that lead to 
inhibition of reverse transcriptase activity, inhibition of strand transfer activity or inhibition of 
RNaseH activity. These mutations are located at amino acids 185, 266 and 478 of HIV strain 
HXB2 pol respectively (see paragraphs 0136 and 0137 of the present application). This 
disclosure in the context of the very large number of publicly available HIV pol sequences and 
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HIV pol mutations described in the scientific literature, is adequate to meet the written 
description requirement 

Given the large number of HIV sequences known to researchers, Applicants description 
of mutations in HIV HXB2 pol that lead to reduced reverse transcriptase activity, reduced strand 
transfer activity or reduced RNaseH is sufficient to meet the written description requirement of 
35 U.S.C. §112, first paragraph for the present claims. In view of this, Applicants respectfully 
request that the Examiner withdraw this written description rejection under 35 U.S.C. §112, first 
paragraph. 

Rejections Under 35 U.S.C. SI 12, first paragraph (new matter) 

The Examiner rejected claims 33-35 as allegedly containing new matter. The Examiner 
argued that the phrases: "reducing reverse transcriptase activity", "reducing strand transfer 
activity", and "reducing RNaseH activity" in claims 33-35, respectively, constitute new matter. 
Specifically, the Examiner stated that these phrases do not appear in the specification. The exact 
phrases used in a claim do not need to appear in the specification. Cordis Corp. v. Medtronic 
Ave, Inc., 339 F.3d 1352 (Fed. Cir. 2003) ("The disclosure as originally filed, does not, 
however, have to provide in haec verba support for the claimed subject matter at issue"). 
However, Applicants have amended claims 33-35 to recite: "inhibits reverse transcriptase 
activity", "inhibit strand transfer activity", and "inhibits RNaseH activity", respectively. Support 
for these limitations can be found, for example, in paragraph 0137 of the present specification. 

In view of the forgoing, Applicants respectfully request that the Examiner withdraw this 
new matter rejection under 35 U.S.C. §112, first paragraph. 

Rejections Under 35 U.S.C. SI 12, second paragraph 

The Examiner rejected claims 33-35 as allegedly indefinite. The Examiner argued that 
the terms "reverse transcriptase activity", stand transfer activity" and RNaseH activity" are 
indefinite. The Examiner ask which activity the various phrases refer to and in what biological 
context. 
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"If one skilled in the art would understand the bounds of the claim when read in the light 
of the specification, then the claim satisfies section 1 12, paragraph 2." Miles Labs., Inc. v. 
Shandon, Inc., 997 F.2d 870, 875 (Fed. Cir. 1993). It is Applicants' position that the phrases 
"reverse transcriptase activity", stand transfer activity" and "RNaseH activity" in the context of 
the present application would be understood by those of ordinary skill in the art. Moreover, as 
discussed at length above, HIV pol had been well studied at the time of priority date of the 
present application. Each of the activities mentioned in the claims had been characterized and 
mutations interfering with these activities had been identified. Thus, those skilled in art well- 
understood the various activities possessed by HIV pol, and the phrases "reverse transcriptase 
activity", stand transfer activity" and "RNaseH activity" in the context of the present application 
would be clear to those of ordinary skill in the art. 

In view of the forgoing, Applicants respectfully request that these rejections under 35 
U.S.C. §112, second paragraph be withdrawn. 

Rejections Under 35 U.S.C. $103 

The Examiner rejected claims 31-35 as obvious in view of Kent et al. (J. Virol. 72:10181, 
1998; "Kent") taken with Small et al. According to the Examiner, Kent discloses "priming with 
DNA and boosting with a recombinant Fowlpox virus wherein said DNA and recombinant 
Fowlpox encode (express) the gag, pol and env genes of HIV- 1" and Small discloses "that MVA 
constitutes a safe version of recombinant pox virus". The Examiner argued that it would have 
been obvious to "use the recombinant MVA disclosed by Small et al. in the method described by 
Kent et al, in order to take advantage of the increased safety associated with the use of MVA 
virus in immunization protocols." 

Applicants respectfully traverse this rejection. The MVA employed in the presently 
claimed methods expresses: HIV Gag, HIV Pol lacking the integrase domain, HIV gpl20 and 
HIV gp41, lacking or all or part of its cytoplamic domain." 

Kent describes two different recombinant fowlpox virus. One recombinant virus 
expresses HIV gag and HIV pol. The other recombinant poxvirus expresses HIV env. These are 
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described on page 10181 in the second paragraph under the heading "Recombinant Poxvirus". 
Thus, none of the recombinant poxvirus used by Kent expressed HIV gag, HIV pol and any 
portion of HIV env. In addition, the HIV pol-expressing poxvirus used by Kent apparently 
expressed intact HIV pol, not HIV pol lacking the integrase domain, as required by the present 
claims. Even if Small et al. suggests that MVA is a useful poxvirus for expressing heterologous 
genes, the combination of Kent and Small et al. would not lead one to the recombinant MVA 
used in the present methods. Thus, the cited references, no matter how combined, do not teach 
or suggest the limitations of claim 3 1 . Accordingly, the cited references do not render claim 3 1 
obvious. 

Regarding claims 33-35, which specify that the HIV pol expressed by the recombinant 
MVA lacks certain activities, the Examiner states that "it is deemed (in the absence of evidence 
to the contrary) that the recited genes would not function properly in the context of the claimed 
recombinant virus due to the lack of normal accessory proteins, enhancer etc." Applicants 
respectfully request that the Examiner provide a more complete explanation of the rejection of 
these claims. The HIV pol expressed is intended to be an antigen, not a functional HIV pol. 
Indeed, mutations can be desirable to reduce enzymatic activity that supports HIV replication. In 
any event, claims 33-35 all depend from claim 31. As explained above, the cited references, no 
matter how combined do not teach or suggest the limitations of claim 3 1 . According, the cited 
references cannot render any of the dependent claims (claims 32-38) obvious. 

In view of the forgoing, Applicants respectfully request that the rejections under 35 
U.S.C. §103 be withdrawn. 
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Enclosed is a Petition for Extension of Time with the appropriate fee. Please apply any 
other charges or credits to deposit account 06-1050. 
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Introduction 



This Compendium is an annual printed summary of the data contained in the HIV sequence 
database. In these compendia we try to present a judicious selection of the data in such a way that it is of 
maximum utility to HIV researchers. Traditionally, we present the sequence data themselves in the form 
of alignments: a comprehensive alignment of a selection of all full-length genomes the database contains 
(a lot of LAI-like sequences, for example, have been omitted because they are so similar that they bias 
the alignment) of HI V- 1 /SI Vcpz (Section I) and a combined HIV- 1 /HI V-2/SI V whole genome alignment 
(Section II); amino acid alignments for HIV-l/SIV-cpz, HIV-2/SIV, and SIVagm. The HIV-2/SIV and 
SlVagm amino acid alignments are separate because the genetic distances between these groups are so 
great that presenting them in one alignment would make them very elongated because of the large number 
of gaps that have to be inserted. As always, tables with extensive background information gathered from 
the literature accompany the whole genome alignments. 

The collection of whole-gene sequences in the database is now large enough that we have 
abundant representation of most subtypes (excluding H and J). For most other subtypes, and especially 
for subtype B, a large number of sequences that span entire genes were not included in the printed 
alignments to conserve space. A more complete version of all alignments is available on our website, 
http://hiv-web.lanl.gov Importantly, all these alignments have been edited to include only one sequence 
per person, based on phylogenetic trees that were created for all of them, as well as the literature. At the 
request of many users, we have re-inserted the consensus sequences for each subtype, unless there are 
fewer than five sequences representing it. In the alignments we have also included the Circulating 
Recombinant Forms', mosaic genomes that have epidemiological significance (see the nomenclature 
chapter for more on CRFs). Finally, for all amino acid alignments we have decided to combine the 
annotation tables into one, because of the increasing redundancy in the separate tables. In addition to 
sequence information (accession numbers, references) the new table lists which regions of the sequence 
are represented in the alignments. 

We have made an effort to bring the HIV-2/SIV and SIVagm alignments up-to-date as well. We 
have created an entirely new HIV-1/HIV-2/SIV alignment that is much improved over the previous 
version; it can be accessed via our website. Because of the frequency of redundant information, we have 
decided to merge the gene tables into one large table for each alignment section; we hope you will find 
these tables easy to use. 

In the Reviews section, along with the previously mentioned chapter on HIV nomenclature, you 
will find a very clearly written and concise overview of the functions of TAT; a review of lentivirus 
phylogeny and evolution; an overview of recombinants and their role in the epidemic; a concise and lucid 
chapter on protein search tools on the internet; an overview of HIV-1 coreceptors and coreceptor 
inhibitors; and a very thorough review on progress in SIV and HIV vaccines. In addition, we present 
updated versions of the customary reviews of coreceptor usage, drug resistance, and SIV/SHIV vaccine 
reagents. Reprints of all reviews are available from our website in the form of both HTML and PDF files. 

As always, we are open to complaints and suggestions for improvement. With the effort that goes 
into producing these volumes, we sincerely hope they will be widely used by the research community. 
Inquiries and comments regarding the Compendium should be addressed to: 
Dr. Carla Kuiken 

Theoretical Division, T-10, MS K710, LANL, Los Alamos, NM 87545 
Ph: (505)-665— 6463; fax: (505)-665— 3493; e-mail: kuiken@tlO.lanl.gov 
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HIV/SIV PROTEINS 



NAME 


SIZE 


FUNCTION 


LOCALIZATION 


Gag MA 


pl7 


membrane anchoring; env interac- 


virion 






tion; nuclear transport of viral core. 








(myristylated protein) 




CA 


p24 


core capsid 


virion 


NC 




nucleocapsid, binds RNA 


virion 




P 6 


binds Vpr 


virion 


Protease (PR) 


pl5 


gag/pol cleavage and maturation 


* • 

virion 


Reverse tran- 


p66 


reverse transcription, RNase H 


virion 


scriptase (RT), 


p51 


activity 




RNaseH 








Integrase (IN) 




JJINA provirus integration 


VlliUll 


Env 


gpl20 


external viral glycoproteins bind to 


plasma membrane, virion enve- 




gp41 


CD4 and secondary receptors 


lope 


Tat 


P 16/pl4 


viral trancriptional transactivator 


primarily in nucleolus/nucleus 


Rev 


pl9 


RNA transport, stability and utiliza- 


primarily in nucleolus/nucleus 




tion factor (phosphoprotein) 


shuttling between nucleolus and 








cytoplasm 


Vif 

Til. 


p23 


promotes virion maturation and in- 


cytoplasm (cytosol, membranes) 






fectivity 


virion 


Vpr 


plO-15 


promotes nuclear localization of 


virion, nucleus (nuclear mem- 


preintegration complex, inhibits cell 


brane?) 






division, arrests infected cells at 








G2/M 




Vpu 


pl6 


promotes extracellular release of 


integral membrane protein 




viral particles; degrades CD4 in the 








ER; (phosphoprotein only in HIV-1 








and SIVcpz) 




Nef 


p27-p25 


CD4 and class I downregulation 


plasma membrane, cytoplasm 






(myristylated protein) 


(virion?) 


Vpx 


pl2-16 


vpr homolog (not in HIV-1, only in 


virion (nucleus?) 




HIV-2 and SIV) 





vi Landmarks of the Genome 

LANDMARKS: 

HIV GENOMIC STRUCTURAL ELEMENTS 

LTR Long terminal repeat, the DNA sequence flanking the genome of integrated proviruses. It contains 
important regulatory regions, especially those for transcription initiation and polyadenylation. 

TAR Target sequence for viral transactivatiori, the binding site for Tat protein and for cellular proteins; 
consists of approximately the first 45 nucleotides of the viral mRNAs in HIV-1 (or the first 100 
nucleotides in HIV-2 and SIV.) TAR RNA forms a hairpin stem-loop structure with a side bulge; 
the bulge is necessary for Tat binding and function. 

RRE Rev responsive element, an RNA element encoded within the env region of HIV-1 . It consists of 
approximately 200 nucleotides (positions 7327 to 7530 from the start of transcription in HIV-1, 
spanning the border of gpl20 and gp41). The RRE is necessary for Rev function; it contains 
a high affinity site for Rev; in all, approximately seven binding sites for Rev exist within the 
RRE RNA. Other lentiviruses (HIV-2, SIV, visna, CAEV) have similar RRE elements in similar 
locations within env, while HTLVs have an analogous RNA element (RXRE) serving the same 
purpose within their LTR; RRE is the binding site for Rev protein, while RXRE is the binding site 
for Rex protein. RRE (and RXRE) form complex secondary structures, necessary for specific 
protein binding. 

CRS Cis-acting repressive sequences postulated to inhibit structural protein expression in the absence 
of Rev. One such site was mapped within the pol region of HIV-1. The exact function has not 
been defined; splice sites have been postulated to act as CRS sequences. 

INS Inhibitory/Instability RNA sequences found within the structural genes of HIV-1 and of other 
complex retroviruses. Multiple INS elements exist within the genome and can act independently; 
one of the best characterized elements spans nucleotides 414 to 63 1 in the gag region of HIV-1 . 
The INS elements have been defined by functional assays as elements that inhibit expression 
posttranscriptionally. Mutation of the RNA elements was shown to lead to INS inactivation and 
up regulation of gene expression. 

GENES AND GENE PRODUCTS 

GAG The genomic region encoding the capsid proteins (group specific antigens). The precursor is the 
p55 myristylated protein, which is processed to pl7 (MAtrix), p24 (CApsid), p7 (NucleoCapsid), 
and p6 proteins, by the viral protease. Gag associates with the plasma membrane where the virus 
assembly takes place. The 55 kDa Gag precursor is called assemblin to indicate its role in viral 
assembly. 

POL The genomic region encoding the viral enzymes protease, reverse transcriptase and integrase. 
These enzymes are produced as a Gag-pol precursor polyprotein, which is processed by the viral 
protease; the Gag-pol precursor is produced by ribosome frameshifting at the C-terminus of gag. 

ENV Viral glycoproteins produced as a precursor (gpl60) which is processed to give a noncovalent 
complex of the external glycoprotein gp 120 and the transmembrane glycoprotein gp41. The 
mature gpl20-gp41 proteins are bound by non-covalent interactions and are associated as a 
trimer on the cell surface. A substantial amount of gpl20 can be found released in the medium. 
gpl20 contains the binding site for the CD4 receptor, and the seven transmembrane domain 
chemokine receptors that serve as co-receptors for HIV-1. 

TAT Transactivator of HIV gene expression. One of two essential viral regulatory factors (Tat and 
Rev) for HIV gene expression. Two forms are known, Tat-1 exon (minor form) of 72 amino 
acids and Tat-2exon (major form) of 86 amino acids. Low levels of both proteins are found 
in persistently infected cells. Tat has been localized primarily in the nucleolus/nucleus by 
immunofluorescence. It acts by binding to the TAR RNA element and activating transcription 
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initiation and/or elongation from the LTR promoter. It is the first eukaryotic transcription factor 
known to interact with RNA rather than DNA and may have similarities with prokaryotic anti- 
termination factors. Extracellular Tat can be found and can be taken up by cells in culture. 

REV The second necessary regulatory factor for HIV expression. A 19 kD phosphoprotein, localized 
primarily in the nucleolus/nucleus, Rev acts by binding to RRE and promoting the nuclear 
export, stabilization and utilization of the viral mRNAs containing RRE. Rev is considered the 
most functionally conserved regulatory protein of lentiviruses. Rev cycles rapidly between the 
nucleus and the cytoplasm. 

VIF Viral infectivity factor, a basic protein of typically 23 kD. Promotes the infectivity but not the 
production of viral particles. In the absence of Vif the produced viral particles are defective, while 
the cell-to-cell transmission of virus is not affected significantly. Found in almost all lentiviruses, 
Vif is a cytoplasmic protein, existing in both a soluble cytosolic form and a membrane-associated 
form. The latter form of Vif is a peripheral membrane protein that is tightly associated with the 
cytoplasmic side of cellular membranes. Some recent observations suggest that Vif functions 
late in replication to modulate assembly, budding, and/or maturation the N-terminal half of Vif 
(N'-Vif) specifically interacts with viral protease. 

VPR Vpr (viral protein R) is a 96-amino acid (14 kd) protein, which is incorporated into the virion. 
It interacts with the p6 gag part of the Pr55 gag precursor. Vpr detected in the cell is localized 
to the nucleus. Proposed functions for Vpr include the targeting the nuclear import of preinte- 
gration complexes, cell growth arrest, transactivation of cellular genes, and induction of cellular 
differentiation. It is found in HIV-1, HIV-2, SIVmac and SIVmnd. It is homologous to the vpx 
protein. 

VPU Vpu (viral protein U) is unique to HIV-1 and SIVcpz, a close relative of HIV-1. There is no 
similar gene in HIV-2 or other SIVs. Vpu is a 16-kd (81 -amino acid) type I integral membrane 
protein with at least two different biological functions: (a) degradation of CD4 in the endoplasmic 
reticulum, and (b) enhancement of virion release from the plasma membrane of HIV- 1 -infected 
cells. Env and Vpu are expressed from a bicistronic mRNA. Vpu probably possesses an N- 
terminal hydrophobic membrane anchor and a hydrophilic moiety. It is phosphorylated by 
casein kinase II at positions Ser52 and Ser56. Vpu is involved in env maturation and is not found 
in the virion. Vpu has been found to increase susceptibility of HIV-1 infected cells to Fas killing. 

NEF A multifunctional 27-kd myristylated protein produced by an ORF located at the 3' end of the 
primate lentiviruses. Other forms of Nef are known, including nonmyristylated variants. Nef is 
predominantly cytoplasmic and associated with the plasma membrane via the myristyl residue 
linked to the conserved second amino acid (Gly). Nef has also been identified in the nucleus 
and found associated with the cytoskeleton in some experiments. One of the first HIV proteins 
to be produced in infected cells, it is the most immunogenic of the accessory proteins. The nef 
genes of HIV and SIV are dispensable in vitro, but are essential for efficient viral spread and 
disease progression in vivo. Nef is necessary for the maintenance of high virus loads and for 
the development of AIDS in macaques, and viruses with defective Nef have been detected in 
some HIV-1 infected long term survivors. Nef downregulates CD4, the primary viral receptor, 
and MHC class I molecules, and these functions map to different parts of the protein. Nef 
interacts with components of host cell signal transduction and clathrin-dependent protein sorting 
pathways. It increases viral infectivity. Nef contains PxxP motifs that bind to SH3 domains 
of a subset of Src kinases and are required for the enhanced growth of HIV but not for the 
downregulation of CD4. 

VPX A virion protein of 12 kD found only in HIV-2/SIVmac/SIVsm and not in HIV-1 or SIVagm. 
This accessory gene is a homolog of HIV-1 vpr, and HIV-2/SIV carry both vpr and vpx. Vpx 
function in relation to xpr is not fully elucidated; both are incorporated into virions at levels 
comparable to gag proteins through interactions with Gag p6. Vpx is necessary for efficient 
replication of SIV in PBMCs. Progression to AIDS and death in SIV-infected animals can occur 
in the absence of Vpr or Vpx. Double mutant virus lacking both vpr and vpx was attenuated, 
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whereas the single mutants were not, suggesting a redundancy in the function of Vpr and Vpx 
related to virus pathogenicity. 

STRUCTURAL PROTEINS/VIRAL ENZYMES The products of gag, pol and env genes, which are 
essential components of the retroviral particle. 

REGULATORY PROTEINS Tat and Rev proteins of HIV/SIY and Tax and Rex proteins of HTLVs. 
They modulate transcriptional and posttranscriptional steps of virus gene expression and are essential 
for virus propagation. 

ACCESSORY OR AUXILIARY PROTEINS Additional virion and non-virion- associated proteins 
produced by HIV/SIV retroviruses: Vif, Vpr, Vpu, Vpx, Nef. Although the accessory proteins are in 
general not necessary for viral propagation in tissue culture, they have been conserved in the different 
isolates; this conservation and experimental observations suggest that their role in vivo is very important. 
Their functional importance continues to be elucidated. 

COMPLEX RETROVIRUSES Retroviruses regulating their expression via viral factors and express- 
ing additional proteins (regulatory and accessory) essential for their life cycle. 
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Introduction 45 / 

Table of HIV-l/SIVcpz protein alignments 460 

Gag 470 

Pol 480 

Vif : • 498 

Vpr : 502 

Tat • 504 

Rev 506 

Vpu 508 

Env.. ■- 510 

Nef • 526 



Construction of HIV-l/SIVcpz protein alignments 

The number of full-length gene sequences is still growing rapidly for all genes. The envelope master 
alignment now contains 307 full-length sequences. For the purposes of the printed alignments, we have 
had to limit the number of sequences dramatically. Here we list the criteria we have followed to make 
the selection. 

First, we have decided to end the supremacy of the B clade sequences. More than half (1 98, to be precise) 
of the full-length envelope sequences are still subtype B, though the contribution of other subtypes is 
increasing. We have tried to balance the number of representatives of all subtypes in these alignments. 
For this, we had to make a heavy selection on subtype B sequences. We have tried to include as many 
"classical" sequences as possible. A lot of follow-up work has been done based on lab strains such as 
HXB2, MN, SF2, and JR-CSF/JR-FL, so these strains are included in the alignments. Furthermore, within 
subtype B we have tried to represent sequences from diverse geographical origins, so as to represent a 
broad spectrum of variants. In the case of subtype B, this means that we have included African, Asian 
and Brazilian variants along with the "Western" strains. For sequences from non-B subtypes, we have 
selected a few representative sequences from each dataset, again with an eye on maintaining geographical 
diversity. When possible we have left all representatives of group O in the alignment, as these sequences 
are much more genetically diverse that the subtypes. 
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Explanation of Symbols in Alignments 



Symbol Meaning 



Alignment symbols 

? in consensus no majority-rule consensus could be determined at this position 

x nucleotide missing from codon 

# frameshift, or codon contains N or illegal character 

$ stop codon 



Annotation symbols 



I- -I 

/ 

V 
V 
-> 

<- 
* 

AAA [NxS, NxT] 
A * A [NCS, NCT] 
CD4 
cds 

MHR 
nls 

phos site 

PKC 

Zn-motif 



domain boundaries 

protein start point 

protein end point 

splice site or exon join 

start of overlapping coding region 

end of overlapping coding region 

cysteine 

glycosylation site 
glycosylation site with cysteine 
residue critical for CD4 binding 

coding sequence (indicates regions where two proteins overlap; the 

overlapping proteins use two different reading frames) 

major homology region 

nuclear localization signal 

phosphorylation site 

protein kinase C binding 

Zinc finger binding motif 
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Sources of Annotation in the Alignments 



Prntpin 


Annotation 


Rpfprpnpp 


VJdg 


nhn<: ^itp Spr M 1 1 1 


Yu J Biol Chem 270*4792 f 19951 


Udg 


MHR (284-3021 


Ottpken T Virol 70*3407 H9961 


uag 


PvPa (70S-2411 


Rraatpn T Virol 70*4270 (19961 


uag 


vpi pdUvdglllg UUlllalll 


T ii T Virol 60-6873 (199^1 




T V *sl FO (48Q-4Q41 


KonHo T Virol 70- 1 S9 (\ Q961 


XT^f 

iNei 


myribiyidiioii, ^1-/^ 


Hnano T Virol 6Q-Q3 H 9QS1 
nudng, J viiui yjy.yj \iyyjj 


INei 


A ^T_I (~** /A /\Tt m *yi Aril 1 1 ofir\ti 

ivirii^ Qownmociuiauon, 


Pimi^t r\ 448 T-Tnrrmn Pptrm/imcpc anH ATD^ M QQQ1 
iiguci, p. Ht-o ouiiidii xvcuuvirusCd dim f\iuo \ iyyyj 




Plf ror»niitm«»nt ^7A.7QA 




INei 


nean 01 k^uh oinaing sue 


rlguei, p. H*fo nuiuall IvcirovirUaCa dllU r\lLJC> 1^1777^ 








Vfof 

iNei 


aClUlC ciusier, ^OU-0*tJ 


Pimift n 448 Human RptroviriiQps and AIDS (19991 


iNei 


\rXXr)j f \0/-f 0) 


riuang, J Virol Oy.yj \iyyO) 


XT*>{" 

INei 


DVp /7C QO\ 


Unnno T Virnl fiO-Q1 (1 QQS1 
riuang, J virui oy.yj \iyyjj 


INei 


poiypunne tract, \py-y / ) 


riuang, j viroi oy.yo \iyyo) 


Nef 


PAK binding, (103-105) 


Piguet, p. 448 Human Retroviruses and AIDS (1999) 


Nef 


Beta turn, (128-131) 


Huang, J Virol 69:93(1995) 


Nef 


PxxP, (145-148) 


Huang, J Virol 69:93 (1995) 


Nef 


COP1 recruitment 


Piguet, p. 448 Human Retroviruses and AIDS (1999) 




(152-153) 




Nef 


AP recruitment, (162-163) 


Piguet, p. 448 Human Retroviruses and AIDS (1999) 


Nef 


V-ATPase and Raf-1 


Piguet, p. 448 Human Retroviruses and AIDS (1999) 




binding, (172-173) 




Vpr 


alpha helix, (16-34) 


Cornelissen, ARHR 13:247 (1 997) 


Vpr 


H(S/F)RIG motifs, (71-82) 


Macreadie, PNAS USA 92:2770 (1995) 


Vpu 


all annotations 


Cornelissen, ARHR 13:247 (1997) 


Vpr 


LR domain, (60-82) 


Wang, Gene 178:7(1996) 
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Truncated tRNA-DNA mimics were examined in an in vitro assay for second-strand transfer during human 
immunodeficiency virus type 1 (HIV-1) reverse transcription. Strand transfer in this system requires the 
progressive degradation of the RNA within the 18-mer tRNA-DNA (plus-strand strong stop DNA) intermediate 
to products approximately 8 nucleotides in length. The ability of the truncated substrates to substitute for 
directional processing by RNase H or reverse transcriptase (RT) was examined. Using wild-type HIV-1 RT, 
substrates which truncated the 5' end of the tRNA primer by 6, 9, and 12 nucleotides (A6, A9, and A12, 
respectively) were recognized by RNase H and resulted in strand transfer. An overlap of 5 nucleotides between 
the acceptor and newly synthesized DNA template was sufficient for strand transfer. The mutant RT, E478Q 
correctly catalyzed the initial cleavage of the 18-mer tRNA-DNA mimic in the presence of Mn + ; however, no 
directional processing was observed. In contrast, no RNase H activity was observed with the A6, A9, and A12 
substrates with E478Q RT in this strand transfer assay. However, when complemented with Escherichia coli 
RNase H, E478Q RT supported strand transfer with the truncated substrates. E478Q RT did cleave the 
truncated forms of the substrates, A6, A9, and A12, in a poiymerase-independent assay. The size requirements 
of the substrates which were cleaved by the poiymerase-independent RNase H activity of E478Q RT are defined. 



Reverse transcription is a multistep process that is carried 
out by one virus-encoded enzyme, reverse transcriptase (RT). 
RT is a multifunctional enzyme which possesses RNA-depen- 
dent and DNA-dependent polymerase activities and RNase H 
activity. RNase H functions to remove RNA when it is present 
in an RNA-DNA hybrid (for a review, see reference 2). 

In the process of reverse transcription, the viral RNA is 
converted to double-stranded DNA, which is subsequently in- 
tegrated into the host genome. Human immunodeficiency virus 
type 1 (HIV-1) reverse transcription is initiated (3, 25-27, 33) 
using the cellular tRNA Lys * 3 as a primer (28, 35, 36). The first 
18 3' nucleotides of the tRNA primer are complementary to 
the primer binding site (PBS) sequence on the viral genome. 
Elongation of minus-strand synthesis pauses at the 5' terminus 
of the viral RNA (31), completion of which requires a strand 
transfer, referred to as minus-strand or first-strand transfer. 
Plus-strand synthesis is initiated at the polypurine tract and 
continues through cDNA synthesis of the first 18 nucleotides of 
the tRNA primer (2, 4, 7). The second-strand transfer requires 
removal of the tRNA primer (4, 46, 50, 58), which allows an 
acceptor PBS molecule to enter, and subsequent completion of 
viral DNA synthesis. 

HIV-1 RT consists of a heterodimer of two subunits, p66 
and p51 (14, 34). The p66 subunit consists of the polymerase 
and the RNase H domains; the p51 subunit lacks the RNase H 
domain. Based on resemblance of the crystal structure to a 
right hand (1, 30), the subunits have been further divided into 
subdomains: palm, finger, thumb, connection, and RNase H. 
Mutagenesis studies have identified interactions between the 
polymerase and RNase H domains. Mutations within the 
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thumb subdomain and the primer grip and deletions in the p51 
C terminus decrease RNase H activity (9, 18, 21, 44). 

During the viral life cycle, RNase H functions to degrade the 
viral genome, generate and remove the polypurine tract 
primer, and remove the tRNA primer. Removal of the tRNA 
primer has been extensively characterized for HIV-1 RT, 
Moloney murine leukemia virus RT, and an isolated HIV-1 
RNase H domain (48, 49, 59). RNase H activity has been 
classified as either polymerase dependent or polymerase inde- 
pendent (2). The polymerization active site is spatially sepa- 
rated from the RNase H active site; polymerase-dependent 
RNase H activity results in RNase H cleavages which lag ap- 
proximately 18 to 20 nucleotides behind the site of polymer- 
ization (55, 56). The catalytic residues of the RNase H active 
site are Asp 443, Glu 478, and Asp 498 (13, 17, 47). The 
requirements of RNase H activity during HIV-1 reverse tran- 
scription have been further characterized through the analysis 
of an RNase H-defective mutant, E478Q RT. This RNase H 
mutant possesses only Mn 2+ -dependent RNase H activity. Ad- 
ditionally, it is capable of only a single endoribonucleolytic 
cleavage and lacks the ability to further degrade the tRNA 
primer (10). 

Previously, we showed that RNase H activity is required for 
the HIV-1 second-strand transfer (50). More specifically, a 
single endoribonucleolytic cleavage is not sufficient to allow 
release of the tRNA primer. Rather, subsequent RNase H 
degradation must be carried out by RT or through comple- 
mentation with Escherichia coli RNase H. We have now inves- 
tigated the ability of HIV-1 RT and a mutant enzyme, E478Q 
RT, to support strand transfer with substrates possessing trun- 
cations in the 5' portion of the tRNA primer. These substrates 
have the potential to substitute for RNase H-catalyzed direc- 
tional processing, due to their decreased melting temperatures 
(T m ). If properly cleaved by E478Q RT, the truncated sub- 
strates could potentially support strand transfer without 
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complementation by E. coli RNase H. In polymerase-indepen- 
dent assays, the truncated substrates were recognized and 
cleaved by E478Q RT. However, the results indicate a differ- 
ential recognition by wild-type (WT) RT and E478Q RT on the 
truncated substrates in second-strand transfer reactions. Uti- 
lizing an in vitro strand transfer assay, we have shown that 5 
nucleotides of overlap between the newly synthesized DNA 
strand and the acceptor molecule is sufficient for strand trans- 
fer. 



MATERIALS AND METHODS 

Enzymes and nucleotides. [^- 32 PjATP was purchased from ICN. T4 polynu- 
cleotide kinase was purchased from New England Biolabs. RNasin was pur- 
chased from Promega. Klenow Exonuclease (-) was purchased from Boehringer 
Mannheim. HIV-1 RT was obtained either from Jeffrey Culp and Christine 
Debouch, Department of Protein Biochemistry, SmithKline Beecham Pharma- 
ceuticals (37), or from Stuart Le Grice. The two HIV-1 RT preparations dis- 
played equivalent specificity in the tRNA removal assay, indicating that varia- 
tions in the expression and purification schemes did not result in altered 
biochemical properties (data not shown). The presence of the histidine tag on the 
HIV-1 RT has been previously shown not to influence catalytic functions (32). 
HIV-1 RT E478Q mutant was obtained from Stuart F. Le Grice and the AIDS 
repository (contributor, Stuart F. Le Grice). HIV-l-isolated RNase H (NY427) 
was purified from E. coli containing plasmid pET-NY427 (52). E. coli RNase H 
was purchased from Gibco BRL. 

Oligonucleotides. The RNA-DNA hybrid oligonucleotides (RNA is indicated 
in bold) 17-mer (5' GUUCGGGCGCCACTGCT 3'), 14-mer (5' CGGGCGCCA 
CTGCT 3'), 11-mer (5' GCGCCACTGCT 3'), 17-mer (DNA) (5' AGCAGTG 
GCGCCCGAAC 3'), 14-mer (DNA) (5' AGCAGTG GCGCCCG 3'), 11-mer 
(DNA) (5' AGCAGTGGCGC 3'), HTD-1 (5' GTGTG G AAAATCTCTAGC A 
GTGGCGCCCCGAACAGGGA 3'), 17080 (5' ATCTCTAGCAGTGGCGCC 
CGAACAGGGAC 3'), and 17081 (5' GAAAATCTCTAGCAGTGGCGCCC 
GAACAGGGAC 3') were synthesized by Integrated DNA technologies. 
Oligonucleotides 5785 (5' CCCTCAGCCCTTTTAGTCAGTGTGG 3'), 5786 
(5' CCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTGGCGCCCGAAC 
AGGGACCTGAAAGCGA 3'), 5331 (5' AGCAGTGGCGCCCGAACGCGG 
GGCTTGTCCCT 3'), and 5580 (5' TTTCGCTTTCAGGTCCCTGTTCGGGC 
GCCA 3') were synthesized by the University of Medicine and Dentistry of New 
Jersey. 

Strand transfer substrate preparation. The truncated RNA-DNA hybrid 
strand transfer substrates were prepared as follows. Portions (20 pmol) of the 
17-mer, 14-mer, and 11-mer RNA-DNA hybrids were 5 '-end labeled using T4 
polynucleotide kinase and [7- 32 P]ATP. The radiolabeled RNA-DNA oligonu- 
cleotides were isolated utilizing G-25 spin columns (Boehringer Mannheim). The 
labeled substrates were annealed to 40 pmol of oligonucleotide 5786 in a 25-u.l 
reaction mixture containing 50 mM Tris-HCl (pH 8.0), 50 raM KC1, 8 raM 
MgCl 2 , and 2 mM dithiothreitol (DTT). The substrates were extended using 
Exonuclease (-) Klenow, isolated on gels, and eluted overnight as previously 
described (50). The product sizes for the input 17-mer, 14-mer, and 11-mer 
RNA-DNA hybrids are 44-mer, 41-mer, and 38-mer, respectively. The substrates 
were then annealed to oligonucleotide 5785 (referred to as 26-mer). This oligo- 
nucleotide was also 5'-end labeled in the same manner as described for the 
RNA-DNA hybrids. 

Strand transfer reactions. Reactions were performed as previously described 
(50). Briefly, the annealed RNA-DNA hybrid substrates were incubated with 
either HIV-1 RT or E478Q RT. The strand transfer reactions were performed in 
a 20-ui reaction mixture containing approximately 4 pmol of substrate (substrate 
refers to the 50-mer RNA-DNA oligonucleotide annealed to primer 5785), 4 
pmol of acceptor (oligonucleotide 5580), and 2 pmol of either HIV-1 RT or 
E478Q RT in a reaction buffer containing 20 mM Tris-HCl (pH 7.5), 50 mM KG, 
8 mM MgCl 2 or 8 mM MnCl 2 , 2 mM DTT, and 0.25 \jM each deoxynucleoside 
triphosphate (dNTP) (dATP, dCTP, dGTP, and TTP). Reactions were initiated 
upon the addition of enzyme and performed at 37°C. Aliquots (2.5 jjlI) were 
removed at the indicated time points. The reaction products were analyzed on a 
15% polyacrylamide denaturing gel and exposed to autoradiography film. 

In reactions in which multiple enzymes were used, such as E. coli RNase H, 1 
pmol of the additional enzyme was added after the 12-min time point. 

RNase H cleavage assays. Reactions were performed as previously described 
(49). Briefly, approximately 4 pmol of the indicated RNA-DNA hybrid substrate 
was incubated with either 1 pmol of HIV-1 RT or E478Q RT. RNase H cleavage 
substrates were prepared in the same manner as described for the strand transfer 
substrates. The annealing templates used for extension templates were oligonu- 
cleotides 5786, HTD-1, 17081, 17080, and 5531 for substrates B to F (see Fig. 6), 
respectively. These newly extended substrates were gel isolated and eluted as 
previously described (50). The substrates were again annealed to their corre- 
sponding extension templates. The reaction mixture (20 pi) contained 20 mM 
Tris-HCl (pH 7.5), 50 mM KC1, 8 mM MnCl 2 , and 2 mM DTT. Time course 
reactions were performed, and 3-u.l aliquots were removed at 0, 2, 5, 15, and 30 



min. Samples were analyzed on a 20% polyacrylamide denaturing gel and ex- 
posed to autoradiography film. 

RESULTS 

Strand transfer assays with truncated substrates. The sec- 
ond-strand transfer is a vital intermediary step in the synthesis 
of full-length double-stranded retroviral DNA. An in vitro 
assay has been developed to analyze the second-strand transfer 
reaction during HIV-1 reverse transcription (50). In this assay, 
model substrates were constructed which mimic the interme- 
diate formed during plus-strand strong stop synthesis in which 
the first 18 nucleotides of the tRNA primer are reverse tran- 
scribed (Fig. 1, step 2). In the removal of the tRNA, the initial 
cleavage occurs at the penultimate nucleotide at the 3' end of 
the tRNA (step 3) (51, 53). Previous studies have indicated 
that the tRNA needs to undergo further degradation for 
strand-transfer to proceed (50). Once the tRNA is removed, 
the acceptor molecule can enter and a 70-mer strand transfer 
product results (step 4). 

Previous analysis indicated the RNase H-defective mutant, 
E478Q RT, was capable of performing only a single endoribo- 
nucleolytic cleavage in removing the tRNA primer (10, 50). 
Using our model system, this cleavage yields a 17-mer RNase 
H cleavage product, which was not sufficient for strand transfer 
to proceed (50). Therefore, it was reasoned that decreasing the 
size of the RNA component would lower the T m of the RNA- 
DNA hybrid and could compensate for the lack of directional 
processing by E478Q RT. Model substrates were constructed 
that truncated the 5' terminus of the RNA by 6, 9, or 12 
nucleotides and were named A6, A9, and A12, respectively 
(Fig. 1). These truncated substrates were specifically recog- 
nized and cleaved at the expected position by both WT HIV-1 
RT and an isolated HIV-1 RNase H domain (49). 

Figure 1 summarizes the second-strand transfer assay with 
substrates possessing truncations in the 5' RNA portion. Step 
1 represents the input substrates for the A6, A9, and A12 
constructs, in which the RNA-DNA oligonucleotides are hy- 
bridized with plus-strand primer 5785. In the presence of 
dNTPs and WT HIV-1 RT, polymerization occurs (step 2) and 
the full-length plus-strand product is synthesized. In this com- 
plex, the RT is paused with the terminus of the nucleic acid 
substrate bound in the polymerase active site. Once the RNA- 
DNA hybrid is formed, RNase H can cleave the RNA primer 
(step 3). Using a full-length 18-mer RNA, at the completion of 
plus-strand synthesis, the E478Q RNase H active site is opti- 
mally positioned for the single endoribonucleolytic cleavage to 
occur through a polymerase-dependent mechanism. However, 
with the truncated substrates, a polymerase-independent 
RNase H cleavage would be required to remove the RNA, 
since in a polymerase-dependent assay, the RNase H active site 
would be positioned within a double-stranded DNA region. 
Once the RNA primer is removed, the acceptor molecule 
enters and a 70-mer strand transfer product is produced (step 
4). Decreasing the size of the RNA template affects the overlap 
between the newly synthesized DNA strand and the acceptor 
molecule. As the truncation increases, the amount of overlap 
between the acceptor and newly synthesized strand decreases. 
This assay therefore allows the determination of the minimum 
overlap sequence required for strand transfer to occur. 

HIV-1 RT assayed with truncated substrates. The truncated 
substrates (A6, A9, and A12) were assayed with WT HIV-1 RT 
(Fig. 2) in the presence of Mg 24 ", dNTPs, and acceptor mole- 
cule. Reactions performed in the presence of Mn 2+ yielded 
equivalent results (data not shown). Figure 2 represents time 
courses from 0 to 30 min. Lanes 1 to 6 represent HIV-1 strand 
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Step 1 



Substrates utilized 



U5 



PBS 



5785 3« 

* CCCTCAGAC CCTTTTAGTC AGTGTGG 

GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCGGGCUUGUCCCUG* 
A6 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCGGGCOUG * 

A9 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCGGGC * 

A12 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCG * 



26-mer 
50-mer 
44-mer 
41-mer 
38-mer 



Step 2 



DNA polymerization 



5785 

CCCTCAGACCCTTTTAGTCAGTGTGG 



58/ 52 / 49 / 46-mer 

GGGAAAATCAGTC ACACCTTTTAGAGATCGTCACCGCGGGCUUGDCCCUQ * 50-mer 



A6 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCGGGCUUG 
A9 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCGGGC * 

A12 GGGAAAATCAGTCACACCTTTTAGAGATCGTCACCGCG * 



44-mer 
41-mer 
38-mer 



Step 3 



tRNA Removal by RNase H 



5785 

* CCCTCAGACCCTTTTAGTCAGTGTGG 



GGGAAAATCAGTCACACCTTTTAGAGATCGTCA 



58/ 52 / 49 / 46-mer 



CCGCGGGCUUGUCCCUG* 

CCGCGGGCUUG * 
CCGCGGGC * 
CCGCG * 



17-mer 

11-mer 
8-mer 
5-mer 



Step 4 



Strand Transfer 



5785 

* CCCTCAGACCCTTTTAGTCAGTGTGG 



3* Acceptor 5' 

ACCGCGGGCTTGTCCCTGGACTTTCGCTTT 



70-mer 



GGGAAAATCAGTCACACCTTTTAGAGATCGTCA 



FIG. 1. Second-strand transfer assay with model substrates. This illustrates the model strand transfer assay with the truncated substrate possessing only 12 (A6), 9 
(A9) and 6 (A12) positions of the RNA sequence. Step 1 illustrates the input substrate for each truncated substrate, along with their respective input RNA-DNA sizes. 
Step2 illustrates the polymerization reaction which can occur in the presence of RT and dNTPs and the size of the polymerization product for each substrate, 52-mer 
(A6), 49-mer (A9), and 46-mer (A12). DNA polymerization creates the RNA-DNA hybrid, which is a substrate for the RNase H domain (step 3). Once the RNA has 
been removed between the terminal ribo-A and ribo-C, the acceptor molecule can enter and produce a strand transfer product, 70-mer (step 4). In each step, the RNA 
portion is indicated in bold and the 5' radiolabel is indicated by an asterisk. The size of the strand transfer product (70-mer) would be the same for each truncated 
substrate. 



transfer reactions with the A6 construct. The plus-strand oli- 
gonucleotide, 5785, was quickly extended from a 26-mer to the 
52-mer product. The RNA portion of the RNA-DNA hybrid 
was degraded by the RNase H domain, and a strand transfer 
product (70-mer) was produced. Similarly, HIV-1 RT was ca- 
pable of polymerization, RNase H activity, and strand transfer 
on the A9 substrate (lanes 7 to 12). The A12 construct was also 
assayed with HIV-1 RT (lanes 13 to 18). This construct had the 
largest deletion, possessing only 6 nucleotides of the tRNA 
primer. Therefore, there was only 5 bp of overlap between the 
acceptor and the newly synthesized DNA strand to support 
strand transfer. This substrate also successfully produced a 
strand transfer product (70-mer), albeit at lower efficiency. 
This result indicated that a 5-nucleotide overlap between the 



newly synthesized DNA strand and the acceptor strand is suf- 
ficient to support strand transfer. 

E478Q RT assayed with the truncated substrates. The trun- 
cation substrates were assayed with E478Q RT in the same 
manner as for HIV-1 RT, except that reactions were per- 
formed in the presence of Mn 2+ . Previous analysis indicated 
that the addition of Mn 2+ to the WT HIV-1 RT does not 
change the RNase H or strand transfer properties of the en- 
zyme, allowing direct comparison of the WT and E478Q mu- 
tant RT (50). It had been postulated that the single E478Q 
RNase H cleavage on the A9 and A12 substrates would release 
8-mer and 5-mer RNA species, respectively. The reaction tem- 
perature is above the T m of the products and would allow 
dissociation of the RNA. The T m of the A6 construct, at ap- 
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Substrate 

H1V-1 RT 
Time 

Lane 



A6 A9 
+ + 

0 .5 2 8 15 30 0 .6 2 6 16 30 

1 2 3 4 6 6 7 8 0 10 11 12 



A12 
+ 

0 .6 2 6 16 30 
13 14 16161716 




RNase H 
activity 



FIG. 2. Truncated substrates assayed with HIV-1 RT. Reactions were per- 
formed as described in Materials and Methods. Lanes 1 to 6, 7 to 12, and 13 to 
18 represent HIV-1 RT incubated with the A6, A9, and A12 constructs, respec- 
tively. Time points are indicated above each lane in minutes. Strand transfer 
products (70-mer), DNA primer (26-mer), and RNase H products are indicated 
by arrows. Input substrates for the A6, A9, and A12 constructs are 44-mer, 
41-mer, and 38-mer, respectively. Initial RNase H cleavage products for the A6, 
A9, and A12 constructs are 11-mer, 8-mer, and 5-mer, respectively. 



proximately 42°C, may be too high for dissociation to occur. In 
previous studies, strand transfer correlated with the appear- 
ance of 8-mer RNA products (42, 50). Figure 3, lanes 1 to 6, 
lanes 7 to 12, and lanes 13 to 18 represent E478Q RT assayed 
with the truncations A6, A9, and A12 respectively. For all of the 
constructs, extension products were observed, indicative of 
complete synthesis. Interestingly, no RNase H cleavage prod- 
ucts were observed for any of the constructs assayed in the 
presence of E478Q RT. Reactions performed for up to 40 min 
and/or in the presence of both divalent cations did not yield 
RNase H cleavage products (data not shown). A band was 
observed with the A6 construct (lanes 2 to 6). However, this 
band was also present in the absence of enzyme (lane 1) and is 
therefore not an RNase H cleavage product. These results 
were surprising because previously, E478Q RT was capable of 
producing an RNase H cleavage product on the intact full- 
length substrate (50). Without any RNase H activity, E478Q 
RT was unable to proceed with strand transfer. It was there- 
fore possible that E478Q RT was not capable of catalyzing the 
polymerase-independent RNase H cleavages required for 
strand transfer in this modified assay. 

Complementation of HIV-1 RT and E478Q RT with £. coli 
RNase H. To further characterize the defects of E478Q RT 
with regard to its RNase H cleavage, it was necessary to de- 
termine that this mutated enzyme could perform strand trans- 
fer under these modified conditions with the truncated sub- 
strates. Therefore, both HIV-1 RT and E478Q RT were 
complemented with E. coli RNase H in a strand transfer reac- 
tion. The reactions were performed in the presence of Mg 24 ", 
dNTPs, and acceptor; E. coli RNase H was added after the 
12-min time point for both enzymes. Under these conditions, 
both the WT HIV-1 RT and E. coli RNase H were active and 



their products could be distinguished. The WT HIV-1 RT 
products (Fig. 4A, lanes 3, 9, and 15) were similar to those 
identified in Fig. 2 (lanes 3, 9, and 15). Addition of £. coli 
RNase H after 12 min resulted in much more extensive RNase 
H degradation (Fig. 4, lanes 4, 10, and 16, indicated by vertical 
arrows). For E478Q RT, the visible cleavage activity can only 
be a result of E. coli RNase H since the reactions were per- 
formed in Mg 2+ alone. The results of time course analyses 
using HIV-1 RT and E478Q RT are shown in Fig. 4A and B, 
respectively. Complementation of HIV-1 RT with E. coli 
RNase H was similar to the results (lanes 1 to 6, lanes 7 to 12, 
and lanes 13 to 18) with HIV-1 RT alone. A 70-mer strand 
transfer product was produced in each case. With HIV-1 RT, 
only a single RNase H cleavage product was observed for the 
A12 construct (Fig. 2). However, addition of E. coli RNase H 
yielded RNA products as small as a diribonucleotide (Fig. 4A, 
lane 16). 

Complementation of E478Q RT with E. coli RNase H on 
the A6, A9, and A12 substrates (Fig. 4B, lanes 1 to 6, lanes 7 to 
12, and lanes 13 to 18, respectively) did indeed result in the 
70-mer strand transfer product. These results confirmed the 
requirement of RNase H activity for strand transfer to take 
place. These experiments also reinforce the finding with HIV-1 
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FIG. 3. Truncated substrates assayed with E478Q RT. Reactions were per- 
formed as described in Materials and Methods. Lanes 1 to 6, 7 to 12, and 13 to 
18 represent E478Q RT assayed with the A6, A9, and A12 constructs, respec- 
tively. Time points are indicated above each lane in minutes. Strand transfer 
products (70-mer), DNA primer (26-mer), and RNase H products are indicated 
by arrows. Input substrates for the A6, A9, and A 12 constructs are 44-mer, 
41-mer, and 38-mer, respectively. Initial RNase H cleavage products for the A6, 
A9, and A12 constructs are 11-mer, 8-mer, and 5-mer, respectively. 
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FIG. 4. (A) Complementation of HIV-1 RT with E. coli RNase H. Reactions were performed as described in Materials and Methods. Input RNA-DNA, DNA 
primer, and RNase H cleavage products are indicated by arrows. Reactions were allowed to proceed for 12 min in the presence of Mg 2 "*", and then of E. coli RNase 
H was added (indicated by the vertical arrows). Time points are indicated above each lane in minutes. (B) Complementation of E478Q RT with E. coli RNase H. 
Reactions were performed as described for panel A. Time points are indicated above each lane in minutes. Input RNA-DNA, DNA primer, and RNase H cleavage 
products are indicated by arrows. 



RT that the 5-base overlap between donor and acceptor was 
sufficient for strand transfer to proceed. 

E478Q RT in a polymerase-independent assay. Previous 
characterization of E478Q indicated that it lacks directional 
RNase H processing (10) during polymerization. However, the 
question remained whether E478Q RT can perform any poly- 
merase-independent RNase H cleavages. To test this, a series 
of short, truncated substrates were generated (Fig. 5A). If 
RNase H cleavage occurred at the initial (-1) position (indi- 
cated by the arrow) (51, 53), substrate binding could not be 
dictated by the positioning of the 3 'OH in the polymerase 
active site. These RNA substrates contained the same 5'-ter- 
minal truncations as the RNAs in the strand transfer assay. 
Additionally, these substrates truncated the DNA in the RNA- 
DNA oligonucleotide to 5 nucleotides (Fig. 5A). The RNA- 
DNA strands were hybridized to DNA oligonucleotides 17, 14, 
and 11 nucleotides in length, yielding blunt-ended substrates. 

Figure 5B illustrates these substrates assayed with HIV-1 
RT or E478Q RT. Figure 5B, lanes 1 to 5, 6 to 10, and 11 to 15 
represent HIV-1 RT assayed with the truncated substrates. For 
the 17-mer (lanes 1 to 5), 14-mer (lanes 6 to 10), and 11-mer 
(lanes 11 to 15) substrates, the correct cleavage products at the 
-1 position were observed, as indicated by the arrows. These 
cleavage events have been extensively characterized for HIV-1 



RT and an isolated HIV-1 RNase H domain with a related 
substrate containing a complementary oligonucleotide carrying 
the entire 18-nucleotide PBS sequence (49). The 17-mer, 14- 
mer, and 11-mer substrates assayed with E478Q RT are shown 
in Fig. 5B, lanes 16 to 20, 21 to 25, and 26 to 30, respectively. 
E478Q RT was capable of cleaving the 17-mer and 14-mer 
constructs at the predicted -1 position. In contrast to WT RT, 
E478Q RT did not cleave the 11-mer construct. These results 
indicate that on defined substrates, E478Q RT is capable of 
polymerase-independent cleavages. 

RNase H cleavage analysis of the truncated DNA substrates. 
E478Q RT-RNase H recognized and cleaved the blunt 17-mer 
and 14-mer RNA-DNA hybrid substrates (Fig. 5B) but was 
inactive on the equivalent 52-mer (A6) and 49-mer (A9), RNA- 
DNA hybrid resulting after polymerization (Fig. 3). Two key 
differences between these substrates are the size of the DNA- 
DNA hybrid segment and the position of a 3 'OH group with 
respect to the RNA-DNA hybrid. A large DNA substrate may 
lock the 3 'OH within the polymerase active site, whereas a 
small substrate may have sufficient flexibility to permit binding 
of the substrate into the RNase H active site. 

To address this, a series of substrates were constructed 
which varied the length of the double-stranded (ds) U5 DNA 
associated with the tRNA-DNA mimic (Fig. 6A). The sizes of 
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FIG. 5. E478Q RT assayed in polymerase-independent assay. (A) The substrates utilized are illustrated. Substrates were labeled at the 5' termini with [7- 32 P]ATP 
and prepared as described in Materials and Methods. An asterisk indicates the radiolabel. The RNA portion is indicated in bold. (B) HIV-1 RT assayed with 17-mer 
(lanes 1 to 5), 14-mer (lanes 6 to 10), 11-mer (lanes 11 to 15); E478Q RT assayed with 17-mer (lanes 16 to 20), 14-mer (lanes 21 to 25), and 11-mer (lanes 26 to 30). 
Time points are 0, 2, 5, 15, and 30 min for each set of lanes. Reactions were performed as described in Materials and Methods. 



the ds DNAs ranged from those that were cleaved, (17-mer; 
5-mer DNA plus 12-mer RNA) to those not recognized by 
E478Q (substrate B; 32-mer DNA plus 12-mer RNA). E478Q 
RT was capable of recognizing and cleaving a 50-mer RNA- 
DNA hybrid substrate containing the intact 18-mer RNA por- 
tion plus a 32-mer of DNA (substrate A). However, when the 
RNA portion of the substrate was truncated by 6 nucleotides 
(substrate B), RNase H cleavage was inhibited. Substrates 
were constructed in which the DNA-DNA (ds DNA portion) 
size varied but the size of the RNA-DNA hybrid remained 
constant (12-mer RNA). Substrates C, D, E, and F possessed 
20, 14, 11, and 7 nucleotides, respectively, of ds DNA. The 
17-mer (5-mer DNA plus 12-mer RNA) and 14-mer substrates 
(5-mer DNA plus 9-mer RNA) were shown to be cleaved by 
E478Q RT (see above) (Fig. 5B). 

HIV-1 RT and E478Q RT were assayed for their abilities to 
cleave these substrates in a polymerase-independent assay 
(Fig. 6B and C, respectively). For HIV-1 RT, all of the newly 
constructed substrates (substrates B to F) were cleaved, indi- 
cating that the substrates were intact RNA-DNA hybrids (Fig. 
6B). Since all of the substrates contained 12 nucleotides of 
RNA, the RNase H cleavage sites were identical and an 11- 
mer RNA cleavage product was observed for each substrate. In 
contrast, with E478Q RT (Fig. 6C), the only substrate that was 
efficiently cleaved was substrate F, which possessed 7 nucleo- 
tides in the DNA-DNA portion and 12 nucleotides in the 



RNA-DNA hybrid portion of the substrate. The total size of 
this substrate was 19-mer (7-mer DNA plus 12-mer RNA), 
which approached that required for polymerase-dependent 
RNase H cleavages. This was only 2 nucleotides larger than the 
17-mer substrate previously cleaved by E478Q RT (Fig. 5B). 
This result indicates that E478Q RT can perform polymerase- 
independent RNase H cleavages only on defined small sub- 
strates. It is of interest that these substrates are equal to or less 
than the length defined between the polymerase and RNase H 
active sites. 

DISCUSSION 

Two different modes of RNase H activity have been charac- 
terized for HIV-1 RT: polymerase dependent and polymerase 
independent (20). Polymerase-dependent RNase H cleavages 
are those that occur while the polymerase domain is actively 
synthesizing. The RNase H active site lags approximately 18 to 
20 nucleotides behind the polymerase active site, which has 
been characterized through footprinting analysis for HIV-1 RT 
and Moloney murine leukemia virus RT (55, 57). On the plus- 
strand strong stop tRNA-DNA, tRNA removal occurs while 
the RT is paused trying to use the modified tRNA residue as 
template. Although the polymerase is not actively synthesizing, 
the initial RNase H cleavage can be viewed as polymerase 
dependent since the nucleic acid substrate remains bound in 
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FIG. 6. RNase H cleavage analysis of truncated DNA substrates. (A) The substrates utilized are illustrated and are labeled A through F. The RNA portions are 
indicated in bold, and an asterisk indicates the radiolabel. The substrates were prepared as described in Materials and Methods. (B) Substrates B to F assayed with 
HIV-1 RT. Reactions were performed as described in Materials and Methods. Time course reactions are shown, and time points are indicated above each lane, along 
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performed as described in Materials and Methods. Time course reactions are shown, and time points are indicated above each lane, along with the substrate utilized. 
The RNase H cleavage product is designated and indicated by an arrow. 



the polymerase active site. The initial RNase H cleavage occurs 
at the penultimate nucleotide of the tRNA (53). Polymerase- 
independent cleavages are less well understood and may rep- 
resent different types of cleavages. These cleavages result in 
the production of smaller products (22) and have been defined 
as "directional processing of the RNA primer" (10). Although 
secondary to the initial cleavage during reverse transcription, 
these events are required for the ultimate release of the RNA 
primer. Mutants with defects in the ability to perform this 
function are unable to perform strand transfer (10, 21, 23, 50). 
Polymerase-independent RNase H cleavages require a second 
binding event by RT (23). This "rebinding" event may require 
a change between the type of conformation required for active 
DNA synthesis and polymerase-dependent RNase H activity. 



We have developed an in vitro assay which requires poly- 
merase-independent RNase H activity during the second- 
strand transfer reaction. Previously, we utilized substrates 
which mimic the U5/PBS border, resulting from plus-strand 
strong stop DNA synthesis. The original substrates possessed 
either an 18-mer RNA oligonucleotide or the intact tRNA Lys>3 
(50). With those substrates, the RNase H activity we observed 
was most probably a combination of polymerase-dependent 
and -independent cleavages. The substrates utilized in this 
study have truncations in the 5' RNA portion and have a 
maximum length of 12 ribonucleotides. Therefore, cleavage 
of these substrates would require polymerase-independent 
RNase H activity, due to the suboptimal distance between the 
polymerase and RNase H active sites. 
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A model illustrating the possible binding conformations 
adopted by WT RT and E478Q RT is shown in Fig. 7. This 
model is based on the crystal structure of a covalently trapped 
catalytic complex of RT with DNA (24) (Fig. 7D, left panel). In 
this complex, the ds DNA spans 23 nucleotides and approxi- 
mates the 12-mer RNA plus 11-mer DNA substrates used in 
the present study. In Fig. 7D, left panel, the substrate is posi- 
tioned in the polymerase active site (labeled Pol). In Fig. 7D, 
right panel, the substrate was modified such that a potential 
RNA strand (red) is within the RNase H active site (RH) and 
extended 12 nucleotides toward the polymerase active site. 
This substrate lacks the stabilizing contacts within the thumb 
domain. The extension of the substrate exiting the RNase H 
domain was not modeled due to limited structural data on 
these contacts. Figure 7A to C utilize a schematic of this 
molecular model to summarize the results presented in this 
study. On a full-length model substrate (50-mer: 32-mer DNA 
plus 18-mer RNA) (Fig. 7A), both E478Q RT and WT HIV-1 
RT bind in a polymerase-dependent conformation and pro- 



duce the well-characterized -1 RNase H cleavage product 
(depicted as a nick within the RNA [thick bar]) (50). The size 
of the RNA portion of the substrate allows correct positioning 
of the RNase H active site near the RNA-DNA junction. This 
RNase H cleavage event is driven by the polymerase domain 
and is guided by the 5' phosphate of the RNA and the 3' OH 
of the DNA (39). 

Figure 7B represents the truncated substrate (44-mer: 32- 
mer DNA plus 12-mer RNA). This substrate is suboptimal for 
a polymerase-dependent mode of binding due to its truncation 
in the 5' portion of the RNA. Despite this, the WT RT main- 
tains its RNase H cleavages on these substrates. This implies 
that it is capable of the release and rebinding of these sub- 
strates (23). E478Q RT, which is compromised in the RNase H 
domain, has regained the single endoribonucleolytic cleavage 
ability only in the presence of Mn 2+ (10). E478Q RT binds in 
the same polymerase-dependent orientation; however, RNase 
H cleavage never occurs because the RNase H active site is 
never positioned near the RNA-DNA junction on this sub- 
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strate. It appears that E478Q in the presence of Mn 2+ has not 
maintained the ability to communicate the release and subse- 
quent rebinding to these large substrates, which possess a 
larger portion of ds DNA than the smaller substrates do. Al- 
ternatively, catalysis for the polymerase-independent cleavages 
may be slower than dissociation on these substrates (12). In the 
polymerase-dependent reaction catalyzed by E478Q, the 
RNA-DNA hybrid remains accessible to further degradation 
by exogenously added E. coli RNase H. This indirectly implies 
that the enzyme is released from the substrate. Complemen- 
tation with E. coli RNase H is sufficient for subsequent strand 
transfer catalyzed by E478Q RT. 

When presented with a substrate truncated in the DNA 
portion as well as in the RNA portion (19-mer: 7-mer DNA 
plus 12-mer RNA), E478Q RT was capable of performing an 
RNase H cleavage. This substrate (Fig. 7C) is small enough 
that it is not locked into the polymerase active site. Therefore, 
a polymerase-independent or RNase H active site-driven 
cleavage may occur. HIV-1 RT preferentially binds a hetero- 
duplex RNA-DNA region with a greater affinity than it binds 
ds DNA (12), which may be directing the RNase H cleavage. 
However, once the DNA portion was increased to 11 nucleo- 
tides (23-mer: 11-mer DNA plus 12-mer RNA), the substrate 
was again locked into the polymerase active site and no RNase 
H activity was observed. This indicates that the overall sub- 
strate size must be smaller than the distance between the 
polymerase and RNase H active sites (55, 57) in order for 
E478Q RT to perform these smaller polymerase-independent 
RNase H cleavages. The E478Q mutation may alter the ability 
of the enzyme to translocate across larger substrates to posi- 
tion itself correctly for RNase H cleavage, whereas a smaller 
substrate can slide into the RNase H active site. 

Several factors have been identified which contribute to the 
positioning of the template. These include the position of the 
primer 3'OH, the position of the 5' phosphate of the RNA, 
and the minor groove binding tract within the thumb sub- 
domain. We propose that with substrates longer than 19 nu- 
cleotides, a polymerase-dependent conformation dominates, 
whereas with the smaller substrates, binding to either the poly- 
merase or RNase H active sites may occur. This may be a 
combination of the high affinity of the polymerase domain for 
the termini of the substrate and a result of the defect in the 
RNase H domain of E478Q RT. With E478Q, subsequent 
rebinding to the substrate may remain driven solely by the 
recognition of the termini of the substrate. Rebinding (23) 
would occur in the same location and would never position the 
RNA-DNA substrate in a conformation required for a poly- 
merase-independent cleavage. It is therefore surprising that 
the smaller substrates are capable of binding E478Q within the 
RNase H domain. This implies that there may be a stabilizing 
effect of having a substrate exit the RNase H domain. Reports 
have suggested that substrate binding by the RNase H domain 
contributes to processive DNA synthesis (8). 

Studies have been performed which show that the 5' end of 



the RNA sequence can be responsible for directing its cleavage 
by HIV-1 RNase H (39, 40). Those studies suggested that RT 
is capable of binding the substrate in two different ways: either 
based on the 3' end of the DNA strand or based on the 5' end 
of the RNA strand. The binding was dependent on the sub- 
strate structure and on whether the DNA or RNA ends were 
recessed. When binding was driven by the position of the RNA 
5' phosphate, RNase H cleavage still occurred 18 nucleotides 
upstream, maintaining the optimal spatial distance between 
the two active sites. However, with the substrates utilized in 
our study, binding of the 5' phosphate within the polymerase 
active site would position the RNase H active site within the ds 
DNA region. WT RT is able to overcome this obstacle; there- 
fore, this cannot be the dominant feature, since cleavage at the 
penultimate nucleotide within the RNA was consistently ob- 
served. 

Another contributing factor may be the role of the p66 
thumb subdomain in positioning the RNA-DNA substrate. In 
the crystal structure of HIV-1 RT bound with nonnucleoside 
inhibitor, the position of the thumb differs from that of the 
inhibitor free enzyme (11, 15, 16, 45). The a-helix H within the 
thumb acts as a minor groove binding tract (5, 6). This helix 
plays a role in the binding of primer-template complexes. The 
RT mutant W266A (mutated within a-helix H) lost the ability 
to position the 3'OH/5' phosphate within the polymerase ac- 
tive site and resulted in imprecise removal of the polypurine 
tract primer (43). Molecular modeling of a truncated substrate 
in the RNase H active site precluded the interaction of a 
12-base extension with the minor groove binding tract (Fig. 
7D, right). Flexibility of the thumb and/or perturbation of the 
size of the major and minor groove of the substrate could alter 
these interactions. Cooperativity of the thumb with either the 
polymerase or RNase H active sites could therefore be a dis- 
criminating factor for the positioning of the tRNA-DNA sub- 
strate. 

We propose two different binding conformations for HIV-1 
RT in removing the tRNA, a polymerase-dependent binding 
mode and a polymerase-independent binding mode. Differen- 
tial conformations for endonuclease and exonuclease RNase H 
activities were previously described (60). Another study dem- 
onstrated that in the switch from initiation to elongation, the 
RT must dissociate before efficient and processive elongation 
can occur (31). That study also concluded that the length of the 
synthesized DNA affects this switch, since it would change the 
proximity of the RNA/DNA junction on the primer strand 
relative to a-helix H. The truncated substrates utilized in our 
present study would place the RNase H active site within the 
U5 DNA portion once synthesis is complete; therefore, disso- 
ciation would have to occur for any RNase H cleavage to 
occur. HIV-1 RT has no defects in its ability to perform this 
switch and to cleave truncated substrate intermediates. How- 
ever, the E478Q RT was unable to carry out these cleavages on 
substrates that were identical in size. 

The present study has also determined that an overlap of 5 



FIG 7 Model of WT RT and E478Q RT polymerase-dependent and polymerase-independent RNase H activities. The possible binding orientations for HIV-1 RT 
and E478Q RT are shown. (A to C) WT RT and E478Q RT are shown positioned on the various substrates, 50-mer substrate (32-mer DNA, 18-mer RNA) (A), 44-mer 
substrate (32-mer DNA, 12-mer RNA) (B), and 19-mer (7-mer DNA, 12-mer RNA) (C). The RNA portion is indicated by the thick ine, and the DNA is indicated 
by the solid black line. The position of RNase H cleavage is indicated by a nick in the RNA strand. The WT and E478Q RT can be distinguished by the presence of 
an R (WT) or E/Q (E478Q RT) in the RNase H domain. Additionally, the thumb and polymerase domains are indicated by T and P respectively. The 5 phosphate 
fs indicated as well as the size of the RNA on each model substrate. (D) Models of substrates bound in the polymerase active site (left) and the RNase H active site 
riehOk HIV-1 RT The electrostatic potential mapped on the molecular surface rendering of the HIV-1 RT (GRASP [38]) is shown with the template-primer as 
bound in the structure reported by Huang et al. (24) (lrtd), and substrates are shown as stick models. Positively charged amino acids are shown n, iblue and neg a ively 
charged amino acids are shown in red. In the right-hand panel, the substrate found in the lrtd structure has been truncated to include only 

extending from the RNase H active site. This truncated substrate makes very limited interactions with the thumb. T, thumb; F, fingers; RH, RNase H active site, Pol, 
polymerase active site. 
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We have examined amino acid substitutions at residues 115 and 
116 in the reverse transcriptase (RT) of HIV-1. A number of prop- 
erties were examined, including polymerization and processivity 
on both DNA and RNA templates, strand displacement ribonucle- 
otide misincorporation, and resistance to nucleoside analogs. The 
RT variants Tyr-115-Phe and Phe-116-Tyr are similar to wild-type 
HIV-1 RT in most, but not all, respects. In contrast, the RT variant 
Tyr-115-Val is significantly impaired in polymerase activity com- 
pared with wild-type RT; however, Tyr-115-Val is able to incorpo- 
rate ribonucleotides as well as deoxyribonucleotides during poly- 
merization and is resistant to a variety of nucleoside analogs. 

The reverse transcriptase (RT) of HIV-1 is essential for the 
replication of the virus (for a review, see ref. 1) and is an 
important target for antiviral therapy. Two major classes of RT 
inhibitors, nucleoside analogs and nonnucleoside inhibitors, have 
been discovered. However, mutations within RT engender resis- 
tance to both classes of inhibitors (for reviews, see refs. 2-4). To 
understand in greater detail how HIV-1 RT is able to mutate to 
render nucleoside analogs less effective, the nucleotide binding site 
at the polymerase active site must be studied in greater detail. 

Based on the available crystal structures of Moloney murine 
leukemia virus (MMLV) RT and HIV-1 RT, models have been 
developed for nucleotide binding sites of these two related enzymes. 
In the available models (5), the amino acids Phe-155 (MMLV RT) 
and Tyr-115 (HIV-1 RT) interact with the deoxyribose of the 
incoming dNTP. The structure of a ternary complex of HIV-1 RT, 
a double-strand DNA, and a bound dTTP now has been deter- 
mined (6) and shows that, as predicted (5), Tyr-115 interacts directly 
with the deoxyribose of the incoming triphosphate. 

In analyzing mutations at Tyr-115, several groups have found 
that the only amino acid substitution that has little deleterious 
effect on the enzyme is Tyr-115-Phe (7-14). Substitution of 
Tyr-115 with polar amino acids is extremely deleterious (9-13) 
whereas substitution with hydrophobic amino acids usually has 
an intermediate effect (7, 8, 11, 12, 14). Recently, the Tyr-115- 
Phe mutation, along with the mutations Lys-65-Arg, Leu-74- 
Val, and Met-184-Val, has been detected in HIV-1 RTs resistant 
to the carbocyclic nucleoside analog 1592U89 (abacavir) (15, 
16). The effects of amino acid substitutions at residue 116 have 
been less well characterized (7, 8), although the amino acid 
substitution Phe-116-Tyr is one of the mutations present in an 
RT variant resistant to multiple dideoxynucleotides (17-19). In 
this study, we have analyzed the effects of mutations Tyr-115- 
Phe, Tyr-115-VaI, and Phe-116-Tyr in the nucleotide-binding 
pocket on the activity of HIV-1 RT. 

Some of the substitutions we have chosen to make in HIV-1 
RT represent amino acids found at the equivalent positions in 
other viral polymerases. As has already been mentioned, there 
is a phenylalanine in MMLV RT at the position equivalent to 
position 115 of HIV-1 RT (5), and a tyrosine in hepatitis B RT 
at the position equivalent to position 116 of HIV-1 RT (20). We 
also wanted to compare the effects of introducing valine at 
position 115 of HIV-1 RT with the MMLV RT mutant with 
valine at position 155 (21, 22). 



Materials and Methods 

Preparation of p66/p51 Heterodimers. BspMl cassette mutagenesis 
was used to introduce mutations into the p66 coding region (7, 
23). The DNA segments encoding the "His-tagged" RT variants 
were cloned into the vector S-D prot, and the clones were then 
introduced into the Escherichia coli strain BL21 (DE3) pLysE 
(24). The resulting heterodimers were purified by metal chela- 
tion chromatography (24). 

Polymerase and Inhibitor Assays. Wild-type RT and the variant 
RTs were analyzed for polymerase activity by using a variety of 
different template-primer substrates (Fig. 1). As described (26), 
we constructed a series of plasmids containing the polypurine 
tract (PPT, 5'-AAAAGAAAAGGGGGGGGA-3'), U3, R, U5, 
and the primer binding site (PBS, 5'-GTCCCTGT- 
TCGGGCGCCA-3') from the proviral clone pNL 4-3 (25) in 
the vectors Litmus 28 and Litmus 29 (Not). 

For each enzyme sample to be assayed, 0.5 u,g of single-strand 
DNA and 0.1 ul of 10 OD26o/ml oligonucleotide were hybridized. 
The mixture was adjusted to contain 25 mM Tris-Cl (pH 8.0), 75 
mM KC1, 8.0 mM MgCl 2 , 2 mM DTT, 1 ug/ml acetylated BSA, 10 
mM CHAPS (3-[(3-cholamidopropyl)dimethylammonio]-l- 
propanesulfonate), 10 uM each of dATP and dTTP, 5 uM each of 
dGTP and dCTP, and 0.5 ul each of [o^PJdGTP and [cr^dCTP 
in a final volume of 100 ul The reaction was initiated by the 
addition of 0.1 u.g of enzyme and was allowed to proceed for 1 h. 
The reactions were terminated by the addition of 10 ul of 10 mg/ml 
sheared, denatured salmon sperm DNA, and 3 ml of ice-cold 100% 
trichloroacetic acid (TCA). The precipitate was collected on What- 
man GF/C glass filters and counted. 

The RNA-dependent DNA-dependent polymerase (RDDP) 
assays used four template-primer combinations: (i) 
poly(rC>oligo(dG) obtained from Amersham Pharmacia; («) 
poly(rA)-oligo(dT) obtained from Amersham Pharmacia; (Hi) 
PPT-PBS antisense RNA hybridized to the PPT DNA oligonu- 
cleotide; and (iv) PPT-PBS sense RNA hybridized to the PBS 
DNA oligonucleotide. The last two RDDP assays are similar to 
the assays described previously (23) except that RNase-free 
reagents and 1 unit/ul RNase inhibitor (Ambion) were used. 

Antisense RNA was generated from the 5pel-linearized plas- 
mid PPT-PBS(Tn) by using the T7 MEGAscript kit and protocol 
(Ambion) as described (26); sense RNA was generated by using 
the Mrtl-linearized plasmid PPT-PBS(A n ). The full-length 
RNAs generated from these plasmids have poly(A) tails and 
were purified by using the PolyAT tract system (Promega). 



Abbreviations: RT, reverse transcriptase; MMLV, Moloney murine leukemia virus; DO DP, 
DNA-dependent DNA polymerase; RDDP, RNA-dependent DNA polymerase; PPT, polypu- 
rine tract; PBS, primer binding site. 
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Fig. 1. Polymerase activity of the RT variants relative to wild-type RT. The 
level of radioactivity incorporated by wild-type RT represents 100% activity 
and the level of radioactivity incorporated by the RT variants is normalized to 
this value. The template primers used are described in Materials and Methods. 
(A) RDDP activities. (B) DDDP activities. 



The inhibition of wild-type RT and the RT variants by the 
nucleoside analogs ddlTP and ddGTP was assayed by using 
poly(rC)-oligo(dG) as the template-primer whereas the nucleo- 
side analogs 3'-azido-3'-deoxythymidine 5'triphosphate 
(AZTTP) (Moravek Biochemicals, Brea, CA) and ddTTP were 
tested by using the template-primer poly(rA)*oligo(dT). Inhibi- 
tion by (-)-j3-L-2',3 , -dideoxy-3'-thiacytidine triphosphate 
(3TCTP) (Moravek Biochemicals) was assayed by using single- 
strand M13mpl8 DNA (New England Biolabs) hybridized to the 
sequencing primer -47 (New England Biolabs). 

Processivity and Strand-Displacement Assays. The processivity assay 
was done as described (27, 28). For the strand displacement 
assay, an end-labeled PBS DNA oligonucleotide was hybridized 
to single-strand PPT-PBS sense DNA generated from PPT-PBS 
L-28 along with a 10-fold excess of four unlabeled DNA oligo- 
nucleotides. There was a 10-nt gap between the 3' end of the 
labeled PBS oligonucleotide and the 5' end of the first unlabeled 
DNA oligonucleotide; the four unlabeled DNA oligonucleotides 
hybridize to regions of the HI V-l long terminal repeat upstream 
of the PBS region separated by 3-nt gaps. 

The RT reaction conditions were similar to those described 
above, except that the buffer conditions included either 75 mM 
or 35 mM KC1 and that mixture was incubated at 37°C for 30 min. 
The reactions were halted by extraction with phenol/chloroform 
and the nucleic acid was precipitated with isopropanol, the 
sample was resuspended in 10 /xl of loading dye, and 4 pi was 
loaded on a 6% sequencing gel. After electrophoresis, the gel 
was exposed to x-ray film. 

UTP Misincorporation. The UTP misincorporation assay was sim- 
ilar to the DNA-dependent DNA polymerase (DDDP) assays 
described above, except that the reaction conditions included 
10 jtM each of dATP, dGTP, and dCTP, 1 fiM dTTP, and 2 i±\ 
of 800 Ci/mmol [a 22 P]UTP (1 /utM final concentration). After a 
30-min incubation at 37°C, the reaction mixture was passed 
through Sephacryl S-200-h, extracted with phenol/chloroform, 
and precipitated with isopropanol. The sample was resuspended 
in 10 yA of loading dye and 4 /il was loaded on a 6% sequencing 
gel. After electrophoresis, the gel was exposed to x-ray film. 



dGTP and GTP Kinetics. The dGTP kinetic assay was done as 
described (26, 28) except that 20 y\ of 1.0 mM dGTP solution was 
added to 177 /d of ddH 2 0 and 3 fi\ of 800 Ci/mmol [a 32 P]dGTP 
to make a final concentration of 100 pM. The GTP kinetic assay 
was done in a similar manner except that radioactive 100 jxM 
GTP mixture was made by mixing 20 /ml of 1 mM GTP with 170 /Ltl 
of ddH 2 0 and 10 jxl of 3,000 Ci/mmol [a 32 P]GTP. The reaction 
time also was increased from 10 min to 30 min because GTP is 
incorporated less efficiently than dGTP. 

Results 

Polymerase Assays. In the RDDP assay, RT variants Tyr-115-Phe 
and Phe-116-Tyr had RDDP activity slightly higher than that of 
wild-type RT whereas the RDDP activity of Tyr-115-Val was 
approximately 75% the level of wild-type RT (Fig. IA). These 
results are in agreement with the results we obtained with p66 
homodimers (7, 8). RDDP activity also was measured with a PPT 
DNA oligonucleotide hybridized to antisense RNA and with a PBS 
DNA oligonucleotide hybridized to sense RNA. As shown in Fig. 
I A, the RT variants Tyr-115-Phe and Phe-116-Tyr had activity 
levels with these substrates, relative to wild-type HI V-l RT, that 
were similar to their relative activity with poly(rC)-oligo(dG). 
However, Tyr-115-Val had a relative RDDP activity with the 
HIV-1 RNAs (both sense and antisense) that was significantly 
lower than the relative RDDP activity obtained with 
poly(rC)-oligo(dG). One possible explanation for this discrepancy 
is that Tyr-115-Val is able to use homopolymeric templates 
more efficiently than heteropolymeric templates. We tested this 
possibility by using another homopolymeric template: 
poly(rA)-oligo(dT). With poly(rA>oligo(dT), the RT variants Tyr- 
115-Phe and Phe-116-Tyr had relative RDDP activities similar to 
the RDDP activity obtained with other substrates (Fig. IA). 
Although Tyr-115-Val had a higher relative RDDP activity with 
poly(rA)*oligo(dT) than with either the sense or the antisense 
HIV-1 RNA templates, the relative RDDP activity was still signif- 
icantly lower than that measured with poly(rC)-oligo(dG) (Fig. 1A). 

Another possible explanation is that Tyr-115-Val might have 
a lower fidelity than wild-type RT and occasionally might 
incorporate an incorrect dNTP into the growing primer strand 
while using the heteropolymeric template. Such a misincorpo- 
ration might cause the enzyme to stall at the misincorporated 
base. Only one nucleotide (dGTP) is present when 
poly(rC)-oligo(dG) is the template primer, so there would be 
neither misincorporation nor stalling. To test this possibility, we 
repeated the assay with poly(rC)-oligo(dG) except with equal 
amounts of all four dNTPs in the reaction mixture. The level of 
radioactive dGTP incorporated into poly(rC)-oligo(dG) re- 
mained the same whether or not the other dNTPs were present 
(data not shown), suggesting that misincorporation is not re- 
sponsible for the differences in the relative RDDP activity 
obtained with poly(rC)«oligo(dG) and the other substrates. 

The DDDP activity of the RT variant Tyr-115-Phe was similar 
to that of wild-type RT for all three substrates (Fig. IB), whereas 
Phe-116-Tyr had relative DDDP activity levels that were higher 
than wild-type RT with two of the substrates (Fig. IB). Tyr- 
115-Val was significantly impaired with all of the DDDP sub- 
strates when compared with wild-type RT (Fig. IB). 

Processivity. The results of the processivity assays with the PBS 
oligonucleotide and a sense HIV-1 DNA substrate are shown in Fig. 
IA. The RT variant Tyr-115-Phe had the same relative processivity 
as wild-type RT whereas Phe-116-Tyr was more processive and 
Tyr-115-Val was significantly less processive relative to wild-type 
RT. In the RNA genome of HIV-1, there are two well- 
characterized secondary structures located in the R-U5 region: the 
Tar element (1) and the poly(A) hairpin (34). These secondary 
structure elements, which also should be present in single-strand 
sense DNA, might cause HIV-1 RT to pause and increase the 
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Fig. 2. Processivity assays with DNAand RNA templates. End-labeled primers 
were annealed to template DNA and RNA and extended in the presence of a 
cold trap that prevents multiple rounds of polymerization. (A) Processivity 
with the PBS DNA oligonucleotide annealed to sense PPT-PBS DNA. (B) Pro- 
cessivity with the PBS DNA oligonucleotide annealed to sense PPT-PBS RNA. 
The locations of the 3' ends of two known secondary-structural elements, TAR 
(for a review, see ref. 1) and the poly(A) hairpin (34), are shown at the right. 
Duplicate assays, labeled 1 and 2, were done for each sample. WT, wild type. 

probability of disassociation from the template primer. As shown in 
Fig. 2A, there are a series of extension products whose length 
corresponds to the 3' end of the poly(A) hairpin, which is the 
element that the extending polymerase encounters first. There are 
some extension products that appear to terminate near the 3' end 
of the TAR element; however, there are fewer of these products 
than those that correspond to the 3' end of the poly(A) hairpin (Fig. 
2/4). Similar results were obtained when the processivity substrate 
was a DNA oligonucleotide annealed to single-strand M13mpl8 
DNA (data not shown). With a PPT oligonucleotide and an 
antisense HIV-1 DNA substrate, both Tyr-1 15-Phe and Phe-116- 
Tyr appeared to be slightly more processive and Tyr-115-Val was 
significantly less processive (data not shown). 

The reported K m for Tyr-115-Val for dTTP (62.7 /xM) is 
significantly higher than wild-type RT (6.7 M), whereas Tyr-115- 
Phe has a reported K m for dTTP of 3.0 jiM (12). Although a dTTP 
concentration of 10 /iM, which normally is used in the processivity 
assay, should be sufficient for wild-type RT, this concentration 
might be limiting for Tyr-115-Val because dTTP is present at a 
concentration approximately one-sixth of the reported K m value. 
Therefore, we repeated the processivity assay by using a DNA 
oligonucleotide annealed to single-strand M13mpl8 DNA but with 
the concentration of each nucleotide at 50 fiM. The processivity for 
wild-type RT, Tyr-1 15-Phe, and Tyr-115-Val was not significantly 
different when the dNTP concentrations were 50 fxM compared 
with the results with 10 /xM concentrations (data not shown). 

We also analyzed the processivity of the variant RTs on an 
RNA template. A DNA PBS oligonucleotide was hybridized to 
sense HIV-1 RNA (26). Again, Tyr-115-Phe and Phe-116-Tyr 
appear to have slightly higher processivity than the wild-type RT 
(Fig. 2B). Tyr-1 15-Val was significantly impaired compared with 
wild- type RT and the other RT variants. As described above, the 
sense RNA template would contain the TAR (1) and poly(A) 
hairpin (34) secondary structures. Interestingly, the RT exten- 
sion products were somewhat different with RNA and DNA 
templates. As shown in Fig. 2B, the extension products corre- 
sponding to the 3' end of the poly(A) hairpin, the region the RT 
encounters first, were not as intense with an RNA template as 
with a DNA template (Fig. 2A). The region of strongest pausing 
actually occurred before the poly(A) hairpin was reached (Fig. 




Fig. 3. Strand-displacement activity of wild -type (WT) and mutant HIV-1 RTs. 
The PBS DNA oligonucleotide was end-labeled with [-y^PjATP and annealed to 
sense single-strand DNA. A 10-fold excess of unlabeled DNA oligonucleotides 
also was annealed to the sense-strand DNA to provide the targets to be 
displaced. The RT polymerization reaction was done in the presence of two 
different concentrations of KCI: 75 mM KCI (A) and 35 mM KCI (fl). The 5' end 
of the first unlabeled target DNA oligonucleotide (marked at the right) 
indicates the starting point for strand displacement. Duplicate assays, labeled 
1 and 2, were done for each sample. 

2A) and there appeared to be more pausing at the 3' end of the 
TAR element than with a single-strand sense DNA template. 

Strand Displacement. We analyzed the strand-displacement activ- 
ity of RT in the presence of both 75 mM KCI and 35 mM KCI, 
because Fuentes et al. (29) showed that 75 mM KCI (the 
concentration typically used in various RT assays) can inhibit 
strand displacement, presumably by stabilizing the nucleic acid 
duplex. They also showed that decreasing the concentration of 
KCI in the reaction increases the amount of strand displacement 
(14). As shown in Fig. 3L4, 75 mM KCI inhibited strand- 
displacement activity by RT. Most of the PBS primers were 
extended only to the 5' end of the first unlabeled oligonucleotide. 
The small amount of strand displacement by wild-type RT, 
Tyr-115-Phe, and Phe-116-Tyr extended no farther than 100 nt 
from the 5' end of the first unlabeled DNA oligonucleotide. 
Tyr-1 15-Val did not appear to displace the DNA oligonucleo- 
tides to any measurable extent (Fig. 3/1). In contrast, strand 
displacement was significantly increased in the presence of 35 
mM KCI (Fig. 3B)\ wild-type RT extended the labeled PBS DNA 
oligonucleotide at least 200 nt into the double-strand substrate. 
Tyr-115-Phe and Phe-116-Tyr had significant levels of strand- 
displacement activity but did not appear to be as efficient as 
wild-type RT (Fig. 3B). Tyr-115-Val showed only a very low level 
of strand displacement activity even with the lower concentra- 
tion of KCI. 

Ribonucleotide reincorporation. Gao et al (21, 22) showed that the 
amino acid substitution Phe-155-Val in the MMLV RT allows the 
misincorporation of ribonucleotides into the growing primer strand. 
We tested whether mutants of HIV-1 RT with substitutions at 
Tyr-1 15, which is the amino acid equivalent to Phe-155 in the 
MMLV RT (30-32), would cause a similar misincorporation of 
ribonucleotides with a DNA:DNA template primer. Because 
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Fig. 4. UTP misincorporation by wild-type (WD and mutant HIV-1 RTs. An 
unlabeled sequencing primer was annealed to single-strand M13mp18 DNA. RT 
polymerization was done with radioactive [o^PjUTP in the reaction mixture. 
Extension products with UTP incorporation were radioactively labeled and visible 
on x-ray film. Duplicate assays, labeled 1 and 2, were done for each sample. 



[cr^PJUTP was the only radioactive nucleotide in this assay, any 
UTP that was misincorporated would label the primer strand. Note 
that the concentrations of dTTP and UTP (1.0 jitM) were equiv- 
alent in this reaction. As shown in Fig. 4, wild-type RT and 
Phe-116-Tyr did not produce labeled extension products. The level 
of UTP misincorporation, if any, was below the limit of detection 
in this assay. Tyr-115-Val had a high level of UTP misincorpora- 
tion, producing labeled extension products ranging between 67 and 
147 nt in length (Fig. 4). This result suggests that the Tyr-115-Val 
RT variant was able to continue extending the primer after misin- 
corporation of the UTP. Surprisingly, Tyr-115-Phe also appeared 
to misincorporate UTP into the growing primer strand, albeit at a 
much lower level than Tyr-115-Val (Fig. 4). We repeated these 
experiments using [o^PJATP as the labeled ribonucleotide. Again, 
Tyr-1 15-Val incorporated the labeled ribonucleotide to a signifi- 
cant extent (data not shown). However, Tyr-115-Phe did not 
incorporate detectable amounts of ATP. 

Kinetics of Ribonucleotide Incorporation. Using kinetic assays, we 
analyzed the ability of wild-type RT and Tyr-1 15-Val to use 
deoxynucleotides and ribonucleotides. Wild-type RT showed a 
V max with dGTP of 38.8 ± 0.9 pmol incorporated/min and a K m 
of 4.50 ± 0.2 /xM, which is similar to our previous results (26, 28). 
In contrast to wild-type RT, Tyr-115-Val had a slightly lower 
^max (28.1 ± 0.4 pmol incorporated/min) and a higher K m 
(8.61 ± 0.3 yM) with dGTP (data not shown). The combination 
of a lower V max and a higher K m might explain why Tyr-115-Val 
is less processive than wild-type RT (Fig. 2A). 

There were distinct differences between wild-type RT and 
Tyr-115-Val when GTP was the substrate. Wild-type RT had a 
somewhat higher K max than did Tyr-115-Val (0.17 ± 0.01 pmol 
GTP incorporated/min versus 0.10 ± 0.03 pmol GTP incorpo- 
rated/min); the Kmax for either enzyme was much lower for GTP 
than dGTP. There was a larger difference in the K m . Wild-type 
RT had a K m of 23.1 ± 0.10 /xM GTP; Tyr-115-Val had a K m of 
1.47 ± 0.12 jtM GTP (data not shown). 
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Fig. 5. Inhibition curves for various nucleoside analogs. The amount of radio- 
activity incorporated into the template-primer by wild-type and mutant HIV-1 
RTs in the absence of inhibitor represents 100% enzyme activity. The amount of 
radioactivity incorporated at the various concentration of inhibitor is normalized 
to this value. The template primer for testing ddlTp and ddGTP inhibition was 
poly(rC)-oligo(dG), whereas that for ddTTP and AZTTP was poly(rA)-oligo(dT).The 
template-primer for testing (-)-0-L-2\3'-dideoxy-3'-thiacytidine triphosphate 
(3TCTP) inhibition was a sequencing primer annealed to single-strand M13mp18 
DNA. When single-strand M13mp18wasused as a template with the -47 primer, 
the Tyr-1 1 5-Val mutant produced 36% as much DNA product as wild-type HIV-1 
RT; with poly(rA)-oligo(dT) and poly(rC>oligo(dT), the Tyr-1 15-Val mutant pro- 
duced 50-75% as much DNA product. 

Nucleoside Analog Resistance. Because Tyr-115-Val showed a 
decreased ability to discriminate between deoxyribose and ri- 
bose, we wanted to determine whether Tyr-1 15-Val could 
discriminate between normal dNTPs and various nucleoside 
analogs. We tested three dideoxyribonucleotide analogs as well 
as AZTTP and (-)-0-L-2\3'-dideoxy-3'-thiacytidine triphos- 
phate (3TLTP). As shown in Fig. 5, Tyr-1 15-Val was less 
sensitive to all of the nucleoside analogs than was wild-type RT. 
Tyr-115-Phe and Phe-116-Tyr appeared to be slightly more 
sensitive to the compounds than wild-type RT (Fig. 6). Tisdale 
et al (16) showed that a virus containing the Tyr-115-Phe 
mutation is as susceptible as wild-type RT to several different 
nucleoside analogs. The exception is the carbocyclic nucleoside 
analog 1592U89; the resistance of the Tyr-115-Phe variant was 
about twice that of wild-type RT for this analog (16). 

Discussion 

Although the fidelity of HIV-1 RT is lower than that of some DNA 
polymerases, RT does select the proper dNTP to be added to the 
primer strand with considerable accuracy (approximately 1 error 
for every 10 4 bases polymerized) (see ref. 1 for a review). A dTTP 
residue has been modeled into the structure of HIV-1 RT in 
complex with double-strand DNA (5). A model for the structure of 
binding of a nucleoside triphosphate to MMLV RT also has been 
developed (31). Both models propose that the equivalent amino 
acids in the two structures (Tyr-115 in HIV-1 RT, Phe-155 in 
MMLV RT) interact with the ribose of the incoming triphosphate. 
The structure of a ternary complex of HIV-1 RT, double-strand 
DNA, and a bound dTTP (6) confirms these predictions and 
provides a more precise picture of the actual interactions of the 
protein and the triphosphate at the active site (Fig. 6). To explore 
the consequences of altering amino acids in the nucleotide binding 
pocket, we have analyzed the effects of the amino acid substitutions 
Tyr-115-Phe, Tyr-115-Val, and Phe-116-Tyr. 

The effects of an amino acid substitution on HIV-1 RT can be 
subtle and may not be easily detected by using a simple polymerase 
assay with a homopolymeric template primer such as 
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nucleotides between the acceptor DNA and the newly synthe- 
sized plus strand is sufficient for either the WT RT or E478Q 
mutant enzyme to perform the plus-strand transfer. This re- 
gion of overlap includes the 3'OH of the primer. Deletions of 
acceptor molecules which destroy the base pairing with the 
primer terminus block plus-strand transfer in vitro and in vivo 
(4, 54). Our results are consistent with in vivo results in which 
maintenance of 5 nucleotides of the PBS adjacent to U5 plus 
complementarity of only 3 bp at the site of polymerization was 
sufficient for extension of plus-strand DNA during the second 
template transfer (54), This result is also in agreement with the 
occurrence of retroviral recombination events. In vivo, low 
levels of strand transfer events that yield deletions have been 
characterized with minimal overlap (41). Studies have shown 
that there is an average of 1 aberrant strand transfer event per 
replication cycle (29). 

Interestingly, WT HIV-1 RT was capable of performing 
specific RNase H cleavages on all of the substrates utilized. 
Truncation of the DNA or RNA portion did not affect the 
RNase H cleavage site. Rather, all of the substrates produced 
the identical -1 cleavage product. This may be due to the 
strong structural recognition of the tRNA mimic or to an 
alteration of the binding conformation which allows specific 
RNase H cleavages to occur. Previously, we have demon- 
strated that sequences within the first nine positions of the 
tRNA were important for cleavage and recognition by an iso- 
lated RNase H domain (49). These studies support the concept 
that the structure defined by the tRNA sequence possesses 
strong intrinsic signals that lead to its precise cleavage between 
the terminal ribo-A and ribo-C. This cleavage occurs even 
when a large portion of the tRNA Lys * 3 sequence has been 
deleted. 

We have shown that many factors play a role in influencing 
RNase H activity. The in vitro system developed allows the 
biochemical analysis of a minimal system. In vivo, replication 
occurs within reverse transcription complexes (19) in the pres- 
ence of additional viral proteins, including the nucleocapsid, 
which can influence these reactions. The use of an enzyme with 
defects in the RNase H domain provides insight into the re- 
quirements for polymerase-independent RNase H cleavages. 
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Fig. 6. View of the polymerase active site of HIV-1 RT. The sugar ring of the 
incoming dCTP is positioned between the hydrophobic side chains of Tyr-1 1 5 and 
Ala-1 14 atthe back (Van derWaals spheres in purple), and Glu-151 and Arg-72 in 
front (Van derWaals spheres in cyan). Phe-1 16, although too far away to interact 
directly with the sugar of the dCTP, forms the floor of the 3' OH binding pocket 
(Van derWaals spheres in purple). Yellow lines indicate interactions of the 3' OH 
of dCTP with the ring of Tyr-1 1 5, the main-chain NH of Tyr-1 1 5, the main-chain NH 
of Ala-1 14, 0-phosphate of dCTP, and CO of Glu-151. Bridging interactions of the 
NH group of Glu-151 with the guanidinium of Arg-72 and the main-chain CO of 
Lys-73 are shown as green dotted lines. The interactions of the Tyr-1 1 5 ring and 
the Glu-1 51 side chain with the 2' carbon of dCTP are shown as red dotted lines. 

poly(rC)-oligo(dG). Using HIV-1 genome-based templates, Tyr- 
115-Phe is similar to wild-type RT in the RDDP, DDDP, and 
processivity assays (Figs. 1 A and B and 2 A and B). However, it is 
clear in the strand displacement assay that the RT variant Tyr-1 15- 
Phe is impaired relative to the wild-type enzyme (Fig. 3). As will be 
discussed below, Tyr-115-Phe also might have a decreased ability 
to select dNTPs over NTPs. In contrast, the RT variant Phe-1 16- 
Tyr appears to be better than wild-type RT in most of the RDDP 
and DDDP assays (Fig. 1 A and B) and is more processive than 
wild-type RT (Fig. 2 A and B). However, this variant also is impaired 
in the strand displacement assay relative to wild-type RT (Fig. 14). 

The effects of Tyr-115-Val amino acid substitution are not 
subtle, however. With the exception of the template primer 
poly(rC)-oligo(dG), this enzyme is significantly impaired in both the 
RDDP and DDDP assays compared with wild-type RT (Fig. 1 A 
and B). The Tyr-115-Val variant is also significantly less processive 
than wild-type RT with a variety of template-primers (Fig. 2 A and 
B). Tyr-115-Val has been reported to have a much higher K m for 
dTTP (62.7 /xM) than does wild-type RT (6.7 fiM) (12). We found 
that Tyr-1 15-Val has a higher K m for dGTP than does wild-type RT 
(8.61 jxM dGTP versus 4.50 jaM) and a lower K max (28.1 pmol 
dGTP incorporated/min versus 38.8 pmol incorporated/min for 
wild-type RT). However, the K m we measured for dGTP is signif- 
icantly lower than the reported value for dTTP. Taken together, the 
available data suggest not only that the Tyr-1 15-Val RT variant 
binds the incoming dNTP more weakly than wild-type RT but that 
it binds the incoming dNTP improperly: the difference in the Kmax 
shows that increasing the concentration of dNTPs did not com- 
pletely compensate for the improper binding of the dNTP. 

Mispositioning of the incoming nucleotide also might allow 
the RT variant Tyr-115-Val to incorporate NTPs (Fig. 4). The 
equivalent Phe-155-Val mutation in MMLV RT causes a loss of 
discrimination between dNTPs and NTPs rather than a com- 
plete switch of substrate specificity (21, 33). Wild-type MMLV 
RT has a K m for UTP that is much higher than the K m for dTTP 
(443 ptM versus 9.2 juiM). There is a similar effect on K max . Both 
wild-type MMLV RT and the Phe-155-Val mutant have similar 
K max values for dTTP (0.38 and 0.41 /Limol/min per mg, respec- 
tively). The rate for dTTP incorporation is much higher than for 



UTP (L17 versus 5.45 /xmol/min per mg). For the Phe-155-Val 
variant, the K m values are similar (4.34 /xM versus 13.7 /aM for 
the wild-type RT) (21). Our data suggest that the Tyr-115-Val 
variant of HIV-1 RT also has lost substrate specificity. However, 
there are important differences between the results obtained 
with the Phe-155-Val MMLV RT variant and those obtained 
with the Tyr-1 15-Val HIV-1 RT variant. As described above, 
Tyr-115-Val has been reported to have a significantly higher K m 
for dTTP (62.7 /iM) than wild-type HIV-1 RT (6.7 fjM) (12). 
The difference in the dTTP K m values between wild-type MMLV 
RT and the MMLV RT variant is much smaller (21). 

We measured the K m and V max values for wild-type HIV-1 RT 
and the Tyr-1 15-Val variant with both dGTP and GTP. In 
contrast to wild-type MMLV RT, wild-type HIV-1 RT did not 
have a large difference in the K m values between dGTP and 
GTP. The K m for dGTP was 4.50 ttM whereas the K m for GTP 
was 23.1 fiM. The most significant difference was in the K max for 
wild-type RT, which was 38.8 pmol incorporated/min for dGTP 
versus only 0.17 pmol incorporated/min for GTP (data not 
shown). This result suggests that wild-type HIV-1 RT can bind 
GTP, but does so in a fashion that is unfavorable for polymer- 
ization. Interestingly, Tyr-1 15-Val had a somewhat lower V mBX 
with GTP than did wild-type RT (data not shown), yet in the 
UTP misincorporation assay, Tyr-1 15-Val clearly incorporated 
UTP into the extended primer whereas no incorporation was 
detected for wild-type HIV-1 RT (Fig. 4). 

The answer to this apparent paradox might be found in the Jf m 
values. The Tyr-1 15-Val mutant had a low K m value (1.47 /iM), 
whereas the wild-type RT had a significantly higher K m (23.1 
jxM). The level of [a 32 P]UTP in the UTP misincorporation assay 
was low (1 fiM) and was equal to the level of the preferred 
substrate (1 jaM dTTP). If Tyr-115-Val also had a lower K m for 
UTP relative to wild-type RT, the results of the UTP misincor- 
poration might reflect an ability of Tyr-1 15-Val to bind NTPs at 
the polymerase active site better than wild-type RT, rather than 
an increase in the ability to (mis)incorporate NTPs. 

It has been proposed that the presence of a phenyl ring at 
residue 155 of MMLV RT acts directly as a steric gate that 
distinguishes between a 2' H group of a dNTP and a 2' OH group 
of an NTP (22, 28, 31). In this model, the Phe-155-Val mutant 
replaces the bulky aromatic side chain with the smaller side 
group of valine and removes the steric gate. The structure of a 
ternary complex of HIV-1 RT, double-strand DNA, and a dTTP 
(Fig. 6), although different from the proposed model in struc- 
tural detail, suggests that the corresponding amino acid Tyr-115 
plays a role in forming a steric gate in concert with Q151 (5). As 
shown in Fig. 6, the 2' and 3' positions of the incoming dNTP are 
flanked on one side by the Tyr-115 aromatic side chain and on 
the other side by the Q151 side chain. The 2' position of the 
dNTP is closer to the aliphatic side chain and the main-chain 
carbonyl oxygen of Q151 (3.4 and 3.0 A) than to the side chain 
of Tyr-115 (3.5 A). Nevertheless, the sugar puckering of the 
dNTP directs a 2' OH to the ring of Tyr-115 rather than to the 
Q151 side chain. In the Tyr-115Val mutant, the aromatic side 
chain is replaced by a shorter, more rigid, branched residue. This 
side chain would be expected to move toward the dNTP binding 
site because of unfavorable interactions with the main-chain 
carbonyl oxygen of residue 149, a movement that should permit 
the introduction of an OH group at the 2' sugar position. 

Although the mutant Tyr-1 15-Val was able to incorporate UTP, 
ATP, and GTP more efficiently than wild-type RT, the other 
properties of Tyr-1 15-Val suggest that the amino acid at position 
115 has additional roles beyond that of participating in a simple 
steric gate. Tyr-1 15-Val was clearly defective compared with wild- 
type HIV-1 RT in most of our assays. This mutant had a much lower 
processivity activity with both RNA and DNA templates than did 
wild-type RT, and it was unable to cause strand displacement under 
any of the conditions we tested. However, Tyr-1 15-Val RT showed 
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a high degree of resistance to all of the nucleoside analogs that we 
tested. In the absence of a three-dimensional structure of the 
Tyr-115-Val RT variant with a bound nucleotide, it is unclear how 
the variant is able to select the proper dNTP over a nucleoside 
analog more effectively than wild-type HI V-l RT. However, if the 
Tyr-115-Val variant inappropriately positions the incoming dNTP, 
it also might bind the nucleoside analog incorrectly, suggesting that 
there are steric constraints that help the mutant enzyme discrim- 
inate against certain modified sugars. Although the degree of 
resistance to nucleoside analogs is significant, the Tyr-115-Val 
mutation has not been found in HIV-1 patients treated with 
nucleoside analogs. This is presumably a result of the fact that 
mutations like Tyr-115-Val cause sufficient distortion of the poly- 
merase active site such that HIV-1 variants carrying these muta- 
tions cannot be selected. In support of this idea, an MMLV RT 
variant with an Phe-155-Val mutation does not replicate (22). 

Tyr-115-Val is more efficient at extending a mismatched base 
pair than is wild-type HIV RT (12). It should be remembered that 
this assay does not directly measure the fidelity; rather, it measures 
how well the variants can incorporate dNTPs with a less-than- 
optimal template primer configuration at the active site. The 
incoming dNTP can be considered to have two components: a base 
and a ribose phosphate. The overall fidelity of the enzyme depends 
primarily on how well the base of the incoming dNTP is matched 
to the complementary base on the template strand. The mispair 
extension assay primarily involves the ribose phosphate and how 
well the enzyme can add an incoming dNTP to the mispaired base 
at the 3' end of the primer strand. As discussed above, the 
Tyr-115-Val variant may improperly position the incoming dNTP. 
This mispositioning of the dNTP might allow the Tyr-115-Val 
variant to extend a mismatched pair more efficiently than wild-type 
HIV-1 RT, but this result does not necessarily mean the enzyme has 
a lower fidelity than the wild-type enzyme. Although the ability to 
extend a primer to which the wrong base already has been added 
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is a component of fidelity, this event is secondary to the misincor- 
poration of a dNTP. 

It has been suggested that an MMLV containing the Phe-155- 
Val mutation in the RT coding region is defective in its replication 
because ribonucleotides, once incorporated into the growing DNA 
strand, act as inhibitors of DNA synthesis (22). However, the 
incorporation of ribonucleotides does not appear to block DNA 
synthesis by the HIV-1 RT variants Tyr-115-Phe and Tyr«115-Val 
(Fig. 4). It should be noted that the extension products for both 
Tyr-115-Vai and Tyr-115-Phe in the UTP misincorporation assay 
are relatively large. The major extension products for Tyr-115~Val 
range in size between 67 and 147 nt, and the extension products for 
Tyr-115-Phe are more than 300 nt in size. This result suggests that 
both Tyr-115-Val and Tyr-115-Phe are able to extend the primer 
strand after the misincorporation of UTP. 

In summary, the amino acid residue Tyr-115 of HIV-1 RT has an 
important role in the selection of the proper nucleotide during 
polymerization. Although the amino acid substitution Tyr-115-Phe 
has little effect on the enzyme, the mutation Tyr-115-Val has 
deleterious effects on polymerization. The equivalent positions in 
other polymerases also appear to function in the selection of the 
proper nucleotide and in the rejection of inappropriate substrates 
(for a review, see ref. 30). As more is learned about the nucleotide 
binding region(s) of polymerases, it may be possible to engineer 
polymerases with novel substrate specificities (33). 
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